A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier
In medical data, addressing imbalanced datasets is paramount for accurate predictive modeling. This paper delves into exploring a well-established rebalancing framework proposed in previous research. While acknowledged for its effectiveness, the adaptability of this framework across diverse medical...
Published in: | PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY |
---|---|
Main Authors: | , , , |
Format: | Article |
Language: | English |
Published: |
UNIV PUTRA MALAYSIA PRESS
2024
|
Subjects: | |
Online Access: | https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001343703200011 |
author |
Edward Jafhate; Rosli Marshima Mohd; Seman Ali |
---|---|
spellingShingle |
Edward Jafhate; Rosli Marshima Mohd; Seman Ali A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier Science & Technology - Other Topics |
author_facet |
Edward Jafhate; Rosli Marshima Mohd; Seman Ali |
author_sort |
Edward |
spelling |
Edward, Jafhate; Rosli, Marshima Mohd; Seman, Ali A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY English Article In medical data, addressing imbalanced datasets is paramount for accurate predictive modeling. This paper delves into exploring a well-established rebalancing framework proposed in previous research. While acknowledged for its effectiveness, the adaptability of this framework across diverse medical datasets remains unexplored. We conduct a comprehensive investigation to bridge this gap by integrating an ensemble-based classifier into the existing framework. By leveraging seven imbalanced medical binary datasets, our study comprises three distinct experiments: utilizing standard baseline classifiers from the framework (original), incorporating the baseline with an ensemble-based classifier, and introducing our novel ensemble-based classifier with the self-paced ensemble (SPE) algorithm. Our novel ensemble, composed of decision tree (DT), radial support vector machine (R.SVM), and extreme gradient boosting (XGB) classifiers, serves as the foundation for the SPE. Our primary objective is to demonstrate the potential improvement of the existing framework's overall performance through the integration of an ensemble. Experimental results reveal significant enhancements, with our proposed ensemble classifier outperforming the original by 4.96%, 5.89%, 5.68%, 7.85%, and 6.84% in terms of accuracy, precision, recall, F-score, and G-mean, respectively. This study contributes valuable insights into the adaptability and performance augmentation achievable through ensemble methods in addressing class imbalances within the medical domain. UNIV PUTRA MALAYSIA PRESS 0128-7680 2024 32 6 10.47836/pjst.32.6.12 Science & Technology - Other Topics WOS:001343703200011 https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001343703200011 |
title |
A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier |
title_short |
A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier |
title_full |
A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier |
title_fullStr |
A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier |
title_full_unstemmed |
A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier |
title_sort |
A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier |
container_title |
PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY |
language |
English |
format |
Article |
description |
In medical data, addressing imbalanced datasets is paramount for accurate predictive modeling. This paper delves into exploring a well-established rebalancing framework proposed in previous research. While acknowledged for its effectiveness, the adaptability of this framework across diverse medical datasets remains unexplored. We conduct a comprehensive investigation to bridge this gap by integrating an ensemble-based classifier into the existing framework. By leveraging seven imbalanced medical binary datasets, our study comprises three distinct experiments: utilizing standard baseline classifiers from the framework (original), incorporating the baseline with an ensemble-based classifier, and introducing our novel ensemble-based classifier with the self-paced ensemble (SPE) algorithm. Our novel ensemble, composed of decision tree (DT), radial support vector machine (R.SVM), and extreme gradient boosting (XGB) classifiers, serves as the foundation for the SPE. Our primary objective is to demonstrate the potential improvement of the existing framework's overall performance through the integration of an ensemble. Experimental results reveal significant enhancements, with our proposed ensemble classifier outperforming the original by 4.96%, 5.89%, 5.68%, 7.85%, and 6.84% in terms of accuracy, precision, recall, F-score, and G-mean, respectively. This study contributes valuable insights into the adaptability and performance augmentation achievable through ensemble methods in addressing class imbalances within the medical domain. |
publisher |
UNIV PUTRA MALAYSIA PRESS |
issn |
0128-7680 |
publishDate |
2024 |
container_volume |
32 |
container_issue |
6 |
doi_str_mv |
10.47836/pjst.32.6.12 |
topic |
Science & Technology - Other Topics |
topic_facet |
Science & Technology - Other Topics |
accesstype |
|
id |
WOS:001343703200011 |
url |
https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001343703200011 |
record_format |
wos |
collection |
Web of Science (WoS) |
_version_ |
1818940498309545984 |