Classification Prediction of Familial Hypercholesterolemia using Ensemble-based Classifier with Feature Selection and Rebalancing Technique

Familial hypercholesterolemia (FH) is the most prevalent hereditary hyperlipidemia. Although FH is a significant risk factor of premature coronary heart disease (CHD), it is treatable if detected early and prompt intervention is given. Nevertheless, most people with FH receive inadequate diagnosis a...

Full description

Bibliographic Details
Published in:International Conference on ICT Convergence
Main Author: Edward J.; Rosli M.M.; Chua Y.-A.; Kasim N.A.M.; Nawawi H.
Format: Conference paper
Language:English
Published: IEEE Computer Society 2022
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85143252351&doi=10.1109%2fICTC55196.2022.9952820&partnerID=40&md5=1077235e63251ba559dee7e9184b1bfc
Description
Summary:Familial hypercholesterolemia (FH) is the most prevalent hereditary hyperlipidemia. Although FH is a significant risk factor of premature coronary heart disease (CHD), it is treatable if detected early and prompt intervention is given. Nevertheless, most people with FH receive inadequate diagnosis and treatment, which results in missed opportunities for premature CHD prevention. Therefore, an efficient and faster method of diagnosing FH is crucial for early identification among Malaysians, especially in this age of technology. This study aims to evaluate the performance of ensemble-based classifier and rebalancing strategy with Synthetic Minority Oversampling Technique (SMOTE) towards FH diagnosis in the Malaysian population. Our proposed ensemble-based classifier consists of a combination decision tree, random forest, extreme gradient boosting, ensemble-based classifier using majority voting technique. We also applied Recursive Feature Elimination (RFE) to identify significant features across three well-known diagnostic tools. Experimental findings demonstrate that our proposed ensembled-based classifier with RFE and SMOTE, considerably outperforms the baseline by 99.32% in terms of accuracy, precision, recall, micro-average, macro-average, and G-mean. The proposed ensemble-based classifier with RFE approach selected the same significant features of FH for each of the three diagnostic criteria. We hope that the ensemble-based classifier will aid early detection of FH among Malaysian population and can be used as predictive tool for future studies. © 2022 IEEE.
ISSN:21621233
DOI:10.1109/ICTC55196.2022.9952820