A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier

In medical data, addressing imbalanced datasets is paramount for accurate predictive modeling. This paper delves into exploring a well-established rebalancing framework proposed in previous research. While acknowledged for its effectiveness, the adaptability of this framework across diverse medical...

Full description

Bibliographic Details
Published in:PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY
Main Authors: Edward, Jafhate; Rosli, Marshima Mohd; Seman, Ali
Format: Article
Language:English
Published: UNIV PUTRA MALAYSIA PRESS 2024
Subjects:
Online Access:https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001343703200011
author Edward
Jafhate; Rosli
Marshima Mohd; Seman
Ali
spellingShingle Edward
Jafhate; Rosli
Marshima Mohd; Seman
Ali
A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier
Science & Technology - Other Topics
author_facet Edward
Jafhate; Rosli
Marshima Mohd; Seman
Ali
author_sort Edward
spelling Edward, Jafhate; Rosli, Marshima Mohd; Seman, Ali
A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier
PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY
English
Article
In medical data, addressing imbalanced datasets is paramount for accurate predictive modeling. This paper delves into exploring a well-established rebalancing framework proposed in previous research. While acknowledged for its effectiveness, the adaptability of this framework across diverse medical datasets remains unexplored. We conduct a comprehensive investigation to bridge this gap by integrating an ensemble-based classifier into the existing framework. By leveraging seven imbalanced medical binary datasets, our study comprises three distinct experiments: utilizing standard baseline classifiers from the framework (original), incorporating the baseline with an ensemble-based classifier, and introducing our novel ensemble-based classifier with the self-paced ensemble (SPE) algorithm. Our novel ensemble, composed of decision tree (DT), radial support vector machine (R.SVM), and extreme gradient boosting (XGB) classifiers, serves as the foundation for the SPE. Our primary objective is to demonstrate the potential improvement of the existing framework's overall performance through the integration of an ensemble. Experimental results reveal significant enhancements, with our proposed ensemble classifier outperforming the original by 4.96%, 5.89%, 5.68%, 7.85%, and 6.84% in terms of accuracy, precision, recall, F-score, and G-mean, respectively. This study contributes valuable insights into the adaptability and performance augmentation achievable through ensemble methods in addressing class imbalances within the medical domain.
UNIV PUTRA MALAYSIA PRESS
0128-7680

2024
32
6
10.47836/pjst.32.6.12
Science & Technology - Other Topics

WOS:001343703200011
https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001343703200011
title A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier
title_short A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier
title_full A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier
title_fullStr A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier
title_full_unstemmed A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier
title_sort A Comprehensive Analysis of a Framework for Rebalancing Imbalanced Medical Data Using an Ensemble-based Classifier
container_title PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY
language English
format Article
description In medical data, addressing imbalanced datasets is paramount for accurate predictive modeling. This paper delves into exploring a well-established rebalancing framework proposed in previous research. While acknowledged for its effectiveness, the adaptability of this framework across diverse medical datasets remains unexplored. We conduct a comprehensive investigation to bridge this gap by integrating an ensemble-based classifier into the existing framework. By leveraging seven imbalanced medical binary datasets, our study comprises three distinct experiments: utilizing standard baseline classifiers from the framework (original), incorporating the baseline with an ensemble-based classifier, and introducing our novel ensemble-based classifier with the self-paced ensemble (SPE) algorithm. Our novel ensemble, composed of decision tree (DT), radial support vector machine (R.SVM), and extreme gradient boosting (XGB) classifiers, serves as the foundation for the SPE. Our primary objective is to demonstrate the potential improvement of the existing framework's overall performance through the integration of an ensemble. Experimental results reveal significant enhancements, with our proposed ensemble classifier outperforming the original by 4.96%, 5.89%, 5.68%, 7.85%, and 6.84% in terms of accuracy, precision, recall, F-score, and G-mean, respectively. This study contributes valuable insights into the adaptability and performance augmentation achievable through ensemble methods in addressing class imbalances within the medical domain.
publisher UNIV PUTRA MALAYSIA PRESS
issn 0128-7680

publishDate 2024
container_volume 32
container_issue 6
doi_str_mv 10.47836/pjst.32.6.12
topic Science & Technology - Other Topics
topic_facet Science & Technology - Other Topics
accesstype
id WOS:001343703200011
url https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001343703200011
record_format wos
collection Web of Science (WoS)
_version_ 1818940498309545984