Classification Prediction of Familial Hypercholesterolemia using Ensemble-based Classifier with Feature Selection and Rebalancing Technique

Familial hypercholesterolemia (FH) is the most prevalent hereditary hyperlipidemia. Although FH is a significant risk factor of premature coronary heart disease (CHD), it is treatable if detected early and prompt intervention is given. Nevertheless, most people with FH receive inadequate diagnosis a...

Full description

Bibliographic Details
Published in:International Conference on ICT Convergence
Main Author: Edward J.; Rosli M.M.; Chua Y.-A.; Kasim N.A.M.; Nawawi H.
Format: Conference paper
Language:English
Published: IEEE Computer Society 2022
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85143252351&doi=10.1109%2fICTC55196.2022.9952820&partnerID=40&md5=1077235e63251ba559dee7e9184b1bfc
id 2-s2.0-85143252351
spelling 2-s2.0-85143252351
Edward J.; Rosli M.M.; Chua Y.-A.; Kasim N.A.M.; Nawawi H.
Classification Prediction of Familial Hypercholesterolemia using Ensemble-based Classifier with Feature Selection and Rebalancing Technique
2022
International Conference on ICT Convergence
2022-October

10.1109/ICTC55196.2022.9952820
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85143252351&doi=10.1109%2fICTC55196.2022.9952820&partnerID=40&md5=1077235e63251ba559dee7e9184b1bfc
Familial hypercholesterolemia (FH) is the most prevalent hereditary hyperlipidemia. Although FH is a significant risk factor of premature coronary heart disease (CHD), it is treatable if detected early and prompt intervention is given. Nevertheless, most people with FH receive inadequate diagnosis and treatment, which results in missed opportunities for premature CHD prevention. Therefore, an efficient and faster method of diagnosing FH is crucial for early identification among Malaysians, especially in this age of technology. This study aims to evaluate the performance of ensemble-based classifier and rebalancing strategy with Synthetic Minority Oversampling Technique (SMOTE) towards FH diagnosis in the Malaysian population. Our proposed ensemble-based classifier consists of a combination decision tree, random forest, extreme gradient boosting, ensemble-based classifier using majority voting technique. We also applied Recursive Feature Elimination (RFE) to identify significant features across three well-known diagnostic tools. Experimental findings demonstrate that our proposed ensembled-based classifier with RFE and SMOTE, considerably outperforms the baseline by 99.32% in terms of accuracy, precision, recall, micro-average, macro-average, and G-mean. The proposed ensemble-based classifier with RFE approach selected the same significant features of FH for each of the three diagnostic criteria. We hope that the ensemble-based classifier will aid early detection of FH among Malaysian population and can be used as predictive tool for future studies. © 2022 IEEE.
IEEE Computer Society
21621233
English
Conference paper

author Edward J.; Rosli M.M.; Chua Y.-A.; Kasim N.A.M.; Nawawi H.
spellingShingle Edward J.; Rosli M.M.; Chua Y.-A.; Kasim N.A.M.; Nawawi H.
Classification Prediction of Familial Hypercholesterolemia using Ensemble-based Classifier with Feature Selection and Rebalancing Technique
author_facet Edward J.; Rosli M.M.; Chua Y.-A.; Kasim N.A.M.; Nawawi H.
author_sort Edward J.; Rosli M.M.; Chua Y.-A.; Kasim N.A.M.; Nawawi H.
title Classification Prediction of Familial Hypercholesterolemia using Ensemble-based Classifier with Feature Selection and Rebalancing Technique
title_short Classification Prediction of Familial Hypercholesterolemia using Ensemble-based Classifier with Feature Selection and Rebalancing Technique
title_full Classification Prediction of Familial Hypercholesterolemia using Ensemble-based Classifier with Feature Selection and Rebalancing Technique
title_fullStr Classification Prediction of Familial Hypercholesterolemia using Ensemble-based Classifier with Feature Selection and Rebalancing Technique
title_full_unstemmed Classification Prediction of Familial Hypercholesterolemia using Ensemble-based Classifier with Feature Selection and Rebalancing Technique
title_sort Classification Prediction of Familial Hypercholesterolemia using Ensemble-based Classifier with Feature Selection and Rebalancing Technique
publishDate 2022
container_title International Conference on ICT Convergence
container_volume 2022-October
container_issue
doi_str_mv 10.1109/ICTC55196.2022.9952820
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85143252351&doi=10.1109%2fICTC55196.2022.9952820&partnerID=40&md5=1077235e63251ba559dee7e9184b1bfc
description Familial hypercholesterolemia (FH) is the most prevalent hereditary hyperlipidemia. Although FH is a significant risk factor of premature coronary heart disease (CHD), it is treatable if detected early and prompt intervention is given. Nevertheless, most people with FH receive inadequate diagnosis and treatment, which results in missed opportunities for premature CHD prevention. Therefore, an efficient and faster method of diagnosing FH is crucial for early identification among Malaysians, especially in this age of technology. This study aims to evaluate the performance of ensemble-based classifier and rebalancing strategy with Synthetic Minority Oversampling Technique (SMOTE) towards FH diagnosis in the Malaysian population. Our proposed ensemble-based classifier consists of a combination decision tree, random forest, extreme gradient boosting, ensemble-based classifier using majority voting technique. We also applied Recursive Feature Elimination (RFE) to identify significant features across three well-known diagnostic tools. Experimental findings demonstrate that our proposed ensembled-based classifier with RFE and SMOTE, considerably outperforms the baseline by 99.32% in terms of accuracy, precision, recall, micro-average, macro-average, and G-mean. The proposed ensemble-based classifier with RFE approach selected the same significant features of FH for each of the three diagnostic criteria. We hope that the ensemble-based classifier will aid early detection of FH among Malaysian population and can be used as predictive tool for future studies. © 2022 IEEE.
publisher IEEE Computer Society
issn 21621233
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809678026037788672