Multi-Class Imbalanced Data Classification: A Systematic Mapping Study

Multi-class data classification is distinguished as a significant and challenging research topic in contemporary machine learning, particularly when concerning imbalanced data sets. Hence, a thorough investigation of multi-class imbalanced data classification is becoming increasingly pertinent. In t...

Full description

Bibliographic Details
Published in:Engineering, Technology and Applied Science Research
Main Author: Wang Y.; Rosli M.M.; Musa N.; Li F.
Format: Article
Language:English
Published: Dr D. Pylarinos 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85196059492&doi=10.48084%2fetasr.7206&partnerID=40&md5=a9f9d75d9111bd0a4c2d14a705d8535d
id 2-s2.0-85196059492
spelling 2-s2.0-85196059492
Wang Y.; Rosli M.M.; Musa N.; Li F.
Multi-Class Imbalanced Data Classification: A Systematic Mapping Study
2024
Engineering, Technology and Applied Science Research
14
3
10.48084/etasr.7206
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85196059492&doi=10.48084%2fetasr.7206&partnerID=40&md5=a9f9d75d9111bd0a4c2d14a705d8535d
Multi-class data classification is distinguished as a significant and challenging research topic in contemporary machine learning, particularly when concerning imbalanced data sets. Hence, a thorough investigation of multi-class imbalanced data classification is becoming increasingly pertinent. In this paper, an overview of multi-class imbalanced data classification was generated via conducting a systematic mapping study, which endeavors to analyze the state of contemporary multi-class imbalanced data classification, with the primary goal of ascertaining the corpus of research undertaken in machine learning. To achieve this aim, 7,164 papers were assessed and the 147 prominent ones were selected from five digital libraries, which were further categorized according to techniques, issues, and types of datasets. After a thorough review of these papers, a taxonomy of multi-class imbalanced data classification techniques is proposed. Based on the results, researchers widely employ algorithmic-level, ensemble, and oversampling strategies to address the issue of multi-class imbalance in medical datasets, primarily to mitigate the impact of challenging data factors. This research highlights an urgent need for more studies on multi-class imbalanced data classification. © by the authors.
Dr D. Pylarinos
22414487
English
Article
All Open Access; Gold Open Access
author Wang Y.; Rosli M.M.; Musa N.; Li F.
spellingShingle Wang Y.; Rosli M.M.; Musa N.; Li F.
Multi-Class Imbalanced Data Classification: A Systematic Mapping Study
author_facet Wang Y.; Rosli M.M.; Musa N.; Li F.
author_sort Wang Y.; Rosli M.M.; Musa N.; Li F.
title Multi-Class Imbalanced Data Classification: A Systematic Mapping Study
title_short Multi-Class Imbalanced Data Classification: A Systematic Mapping Study
title_full Multi-Class Imbalanced Data Classification: A Systematic Mapping Study
title_fullStr Multi-Class Imbalanced Data Classification: A Systematic Mapping Study
title_full_unstemmed Multi-Class Imbalanced Data Classification: A Systematic Mapping Study
title_sort Multi-Class Imbalanced Data Classification: A Systematic Mapping Study
publishDate 2024
container_title Engineering, Technology and Applied Science Research
container_volume 14
container_issue 3
doi_str_mv 10.48084/etasr.7206
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85196059492&doi=10.48084%2fetasr.7206&partnerID=40&md5=a9f9d75d9111bd0a4c2d14a705d8535d
description Multi-class data classification is distinguished as a significant and challenging research topic in contemporary machine learning, particularly when concerning imbalanced data sets. Hence, a thorough investigation of multi-class imbalanced data classification is becoming increasingly pertinent. In this paper, an overview of multi-class imbalanced data classification was generated via conducting a systematic mapping study, which endeavors to analyze the state of contemporary multi-class imbalanced data classification, with the primary goal of ascertaining the corpus of research undertaken in machine learning. To achieve this aim, 7,164 papers were assessed and the 147 prominent ones were selected from five digital libraries, which were further categorized according to techniques, issues, and types of datasets. After a thorough review of these papers, a taxonomy of multi-class imbalanced data classification techniques is proposed. Based on the results, researchers widely employ algorithmic-level, ensemble, and oversampling strategies to address the issue of multi-class imbalance in medical datasets, primarily to mitigate the impact of challenging data factors. This research highlights an urgent need for more studies on multi-class imbalanced data classification. © by the authors.
publisher Dr D. Pylarinos
issn 22414487
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1812871795054739456