Multi-Class Imbalanced Data Classification: A Systematic Mapping Study
Multi-class data classification is distinguished as a significant and challenging research topic in contemporary machine learning, particularly when concerning imbalanced data sets. Hence, a thorough investigation of multi-class imbalanced data classification is becoming increasingly pertinent. In t...
Published in: | Engineering, Technology and Applied Science Research |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Dr D. Pylarinos
2024
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85196059492&doi=10.48084%2fetasr.7206&partnerID=40&md5=a9f9d75d9111bd0a4c2d14a705d8535d |
id |
2-s2.0-85196059492 |
---|---|
spelling |
2-s2.0-85196059492 Wang Y.; Rosli M.M.; Musa N.; Li F. Multi-Class Imbalanced Data Classification: A Systematic Mapping Study 2024 Engineering, Technology and Applied Science Research 14 3 10.48084/etasr.7206 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85196059492&doi=10.48084%2fetasr.7206&partnerID=40&md5=a9f9d75d9111bd0a4c2d14a705d8535d Multi-class data classification is distinguished as a significant and challenging research topic in contemporary machine learning, particularly when concerning imbalanced data sets. Hence, a thorough investigation of multi-class imbalanced data classification is becoming increasingly pertinent. In this paper, an overview of multi-class imbalanced data classification was generated via conducting a systematic mapping study, which endeavors to analyze the state of contemporary multi-class imbalanced data classification, with the primary goal of ascertaining the corpus of research undertaken in machine learning. To achieve this aim, 7,164 papers were assessed and the 147 prominent ones were selected from five digital libraries, which were further categorized according to techniques, issues, and types of datasets. After a thorough review of these papers, a taxonomy of multi-class imbalanced data classification techniques is proposed. Based on the results, researchers widely employ algorithmic-level, ensemble, and oversampling strategies to address the issue of multi-class imbalance in medical datasets, primarily to mitigate the impact of challenging data factors. This research highlights an urgent need for more studies on multi-class imbalanced data classification. © by the authors. Dr D. Pylarinos 22414487 English Article All Open Access; Gold Open Access |
author |
Wang Y.; Rosli M.M.; Musa N.; Li F. |
spellingShingle |
Wang Y.; Rosli M.M.; Musa N.; Li F. Multi-Class Imbalanced Data Classification: A Systematic Mapping Study |
author_facet |
Wang Y.; Rosli M.M.; Musa N.; Li F. |
author_sort |
Wang Y.; Rosli M.M.; Musa N.; Li F. |
title |
Multi-Class Imbalanced Data Classification: A Systematic Mapping Study |
title_short |
Multi-Class Imbalanced Data Classification: A Systematic Mapping Study |
title_full |
Multi-Class Imbalanced Data Classification: A Systematic Mapping Study |
title_fullStr |
Multi-Class Imbalanced Data Classification: A Systematic Mapping Study |
title_full_unstemmed |
Multi-Class Imbalanced Data Classification: A Systematic Mapping Study |
title_sort |
Multi-Class Imbalanced Data Classification: A Systematic Mapping Study |
publishDate |
2024 |
container_title |
Engineering, Technology and Applied Science Research |
container_volume |
14 |
container_issue |
3 |
doi_str_mv |
10.48084/etasr.7206 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85196059492&doi=10.48084%2fetasr.7206&partnerID=40&md5=a9f9d75d9111bd0a4c2d14a705d8535d |
description |
Multi-class data classification is distinguished as a significant and challenging research topic in contemporary machine learning, particularly when concerning imbalanced data sets. Hence, a thorough investigation of multi-class imbalanced data classification is becoming increasingly pertinent. In this paper, an overview of multi-class imbalanced data classification was generated via conducting a systematic mapping study, which endeavors to analyze the state of contemporary multi-class imbalanced data classification, with the primary goal of ascertaining the corpus of research undertaken in machine learning. To achieve this aim, 7,164 papers were assessed and the 147 prominent ones were selected from five digital libraries, which were further categorized according to techniques, issues, and types of datasets. After a thorough review of these papers, a taxonomy of multi-class imbalanced data classification techniques is proposed. Based on the results, researchers widely employ algorithmic-level, ensemble, and oversampling strategies to address the issue of multi-class imbalance in medical datasets, primarily to mitigate the impact of challenging data factors. This research highlights an urgent need for more studies on multi-class imbalanced data classification. © by the authors. |
publisher |
Dr D. Pylarinos |
issn |
22414487 |
language |
English |
format |
Article |
accesstype |
All Open Access; Gold Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1812871795054739456 |