Predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation

Hybrid combinations of feature selection, classification and visualisation using machine learning (ML) methods have the potential for enhanced understanding and 30-day mortality prediction of patients with cardiovascular disease using population-specific data. Identifying a feature selection method...

Full description

Bibliographic Details
Published in:	Sains Malaysiana
Main Author:	Aziida N.; Malek S.; Aziz F.; Ibrahim K.S.; Kasim S.
Format:	Article
Language:	English
Published:	Penerbit Universiti Kebangsaan Malaysia 2021
Online Access:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85104477518&doi=10.17576%2fjsm-2021-5003-17&partnerID=40&md5=4338016cc1c3456bba3741d8e1fcfbf0

id	2-s2.0-85104477518
spelling	2-s2.0-85104477518 Aziida N.; Malek S.; Aziz F.; Ibrahim K.S.; Kasim S. Predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation 2021 Sains Malaysiana 50 3 10.17576/jsm-2021-5003-17 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85104477518&doi=10.17576%2fjsm-2021-5003-17&partnerID=40&md5=4338016cc1c3456bba3741d8e1fcfbf0 Hybrid combinations of feature selection, classification and visualisation using machine learning (ML) methods have the potential for enhanced understanding and 30-day mortality prediction of patients with cardiovascular disease using population-specific data. Identifying a feature selection method with a classifier algorithm that produces high performance in mortality studies is essential and has not been reported before. Feature selection methods such as Boruta, Random Forest (RF), Elastic Net (EN), Recursive Feature Elimination (RFE), learning vector quantization (LVQ), Genetic Algorithm (GA), Cluster Dendrogram (CD), Support Vector Machine (SVM) and Logistic Regression (LR) were combined with RF, SVM, LR, and EN classifiers for 30-day mortality prediction. ML models were constructed using 302 patients and 54 input variables from the Malaysian National Cardiovascular Disease Database. Validation of the best ML model was performed against Thrombolysis in Myocardial Infarction (TIMI) using an additional dataset of 102 patients. The Self-Organising Feature Map (SOM) was used to visualise mortality-related factors post-ACS. The performance of ML models using the area under the curve (AUC) ranged from 0.48 to 0.80. The best-performing model (AUC = 0.80) was a hybrid combination of the RF variable importance method, the sequential backward selection and the RF classifier using five predictors (age, triglyceride, creatinine, troponin, and total cholesterol). Comparison with TIMI using an additional dataset resulted in the best ML model outperforming the TIMI score (AUC = 0.75 vs. AUC = 0.60). The findings of this study will provide a basis for developing an online ML-based population-specific risk scoring calculator. © 2021 Penerbit Universiti Kebangsaan Malaysia. All rights reserved. Penerbit Universiti Kebangsaan Malaysia 1266039 English Article All Open Access; Gold Open Access
author	Aziida N.; Malek S.; Aziz F.; Ibrahim K.S.; Kasim S.
spellingShingle	Aziida N.; Malek S.; Aziz F.; Ibrahim K.S.; Kasim S. Predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation
author_facet	Aziida N.; Malek S.; Aziz F.; Ibrahim K.S.; Kasim S.
author_sort	Aziida N.; Malek S.; Aziz F.; Ibrahim K.S.; Kasim S.
title	Predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation
title_short	Predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation
title_full	Predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation
title_fullStr	Predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation
title_full_unstemmed	Predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation
title_sort	Predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation
publishDate	2021
container_title	Sains Malaysiana
container_volume	50
container_issue	3
doi_str_mv	10.17576/jsm-2021-5003-17
url	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85104477518&doi=10.17576%2fjsm-2021-5003-17&partnerID=40&md5=4338016cc1c3456bba3741d8e1fcfbf0
description	Hybrid combinations of feature selection, classification and visualisation using machine learning (ML) methods have the potential for enhanced understanding and 30-day mortality prediction of patients with cardiovascular disease using population-specific data. Identifying a feature selection method with a classifier algorithm that produces high performance in mortality studies is essential and has not been reported before. Feature selection methods such as Boruta, Random Forest (RF), Elastic Net (EN), Recursive Feature Elimination (RFE), learning vector quantization (LVQ), Genetic Algorithm (GA), Cluster Dendrogram (CD), Support Vector Machine (SVM) and Logistic Regression (LR) were combined with RF, SVM, LR, and EN classifiers for 30-day mortality prediction. ML models were constructed using 302 patients and 54 input variables from the Malaysian National Cardiovascular Disease Database. Validation of the best ML model was performed against Thrombolysis in Myocardial Infarction (TIMI) using an additional dataset of 102 patients. The Self-Organising Feature Map (SOM) was used to visualise mortality-related factors post-ACS. The performance of ML models using the area under the curve (AUC) ranged from 0.48 to 0.80. The best-performing model (AUC = 0.80) was a hybrid combination of the RF variable importance method, the sequential backward selection and the RF classifier using five predictors (age, triglyceride, creatinine, troponin, and total cholesterol). Comparison with TIMI using an additional dataset resulted in the best ML model outperforming the TIMI score (AUC = 0.75 vs. AUC = 0.60). The findings of this study will provide a basis for developing an online ML-based population-specific risk scoring calculator. © 2021 Penerbit Universiti Kebangsaan Malaysia. All rights reserved.
publisher	Penerbit Universiti Kebangsaan Malaysia
issn	1266039
language	English
format	Article
accesstype	All Open Access; Gold Open Access
record_format	scopus
collection	Scopus
_version_	1825722582822813696

Predicting 30-day mortality after an acute coronary syndrome (acs) using machine learning methods for feature selection, classification and visualisation

Similar Items