A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System
Increasing cases of heart disease cause more people to wait to be diagnosed, leading to the inefficiency of the diagnosing process in terms of time and human labor required. However, this problem can be alleviated if a prediction mechanism is in place. The purpose of this study is to predict whether...
Published in: | 2021 IEEE 12th Control and System Graduate Research Colloquium, ICSGRC 2021 - Proceedings |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers Inc.
2021
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85114601719&doi=10.1109%2fICSGRC53186.2021.9515250&partnerID=40&md5=d884fff47d269b49f595324ea9d550e5 |
id |
2-s2.0-85114601719 |
---|---|
spelling |
2-s2.0-85114601719 Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E. A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System 2021 2021 IEEE 12th Control and System Graduate Research Colloquium, ICSGRC 2021 - Proceedings 10.1109/ICSGRC53186.2021.9515250 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85114601719&doi=10.1109%2fICSGRC53186.2021.9515250&partnerID=40&md5=d884fff47d269b49f595324ea9d550e5 Increasing cases of heart disease cause more people to wait to be diagnosed, leading to the inefficiency of the diagnosing process in terms of time and human labor required. However, this problem can be alleviated if a prediction mechanism is in place. The purpose of this study is to predict whether a person has heart disease or not. The prediction system will use machine learning algorithms to select attributes obtained from the Cleveland dataset. Prediction is made using two machine learning models, Artificial Neural Network (ANN) and Logistic Regression. Different sizes of hidden layers and activation functions are used to find the hyperparameters with optimal performance. The number of inputs and outputs are kept constant at one with a maximum iteration of 500. Logistic Regression is used to classify a discrete data set and return the probability value where the Sigmoid function acts as the cost function. Finally, a confusion matrix was used to compare the performance of both models. ANN resulted in higher accuracy of 92.31% and an F1-score of 93.2% compared to Logistic Regression with 90.11% accuracy and an F1-score of 91.26%. © 2021 IEEE. Institute of Electrical and Electronics Engineers Inc. English Conference paper |
author |
Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E. |
spellingShingle |
Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E. A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System |
author_facet |
Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E. |
author_sort |
Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E. |
title |
A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System |
title_short |
A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System |
title_full |
A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System |
title_fullStr |
A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System |
title_full_unstemmed |
A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System |
title_sort |
A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System |
publishDate |
2021 |
container_title |
2021 IEEE 12th Control and System Graduate Research Colloquium, ICSGRC 2021 - Proceedings |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1109/ICSGRC53186.2021.9515250 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85114601719&doi=10.1109%2fICSGRC53186.2021.9515250&partnerID=40&md5=d884fff47d269b49f595324ea9d550e5 |
description |
Increasing cases of heart disease cause more people to wait to be diagnosed, leading to the inefficiency of the diagnosing process in terms of time and human labor required. However, this problem can be alleviated if a prediction mechanism is in place. The purpose of this study is to predict whether a person has heart disease or not. The prediction system will use machine learning algorithms to select attributes obtained from the Cleveland dataset. Prediction is made using two machine learning models, Artificial Neural Network (ANN) and Logistic Regression. Different sizes of hidden layers and activation functions are used to find the hyperparameters with optimal performance. The number of inputs and outputs are kept constant at one with a maximum iteration of 500. Logistic Regression is used to classify a discrete data set and return the probability value where the Sigmoid function acts as the cost function. Finally, a confusion matrix was used to compare the performance of both models. ANN resulted in higher accuracy of 92.31% and an F1-score of 93.2% compared to Logistic Regression with 90.11% accuracy and an F1-score of 91.26%. © 2021 IEEE. |
publisher |
Institute of Electrical and Electronics Engineers Inc. |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1818940561115054080 |