A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System

Increasing cases of heart disease cause more people to wait to be diagnosed, leading to the inefficiency of the diagnosing process in terms of time and human labor required. However, this problem can be alleviated if a prediction mechanism is in place. The purpose of this study is to predict whether...

Full description

Bibliographic Details
Published in:2021 IEEE 12th Control and System Graduate Research Colloquium, ICSGRC 2021 - Proceedings
Main Author: Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2021
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85114601719&doi=10.1109%2fICSGRC53186.2021.9515250&partnerID=40&md5=d884fff47d269b49f595324ea9d550e5
id 2-s2.0-85114601719
spelling 2-s2.0-85114601719
Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System
2021
2021 IEEE 12th Control and System Graduate Research Colloquium, ICSGRC 2021 - Proceedings


10.1109/ICSGRC53186.2021.9515250
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85114601719&doi=10.1109%2fICSGRC53186.2021.9515250&partnerID=40&md5=d884fff47d269b49f595324ea9d550e5
Increasing cases of heart disease cause more people to wait to be diagnosed, leading to the inefficiency of the diagnosing process in terms of time and human labor required. However, this problem can be alleviated if a prediction mechanism is in place. The purpose of this study is to predict whether a person has heart disease or not. The prediction system will use machine learning algorithms to select attributes obtained from the Cleveland dataset. Prediction is made using two machine learning models, Artificial Neural Network (ANN) and Logistic Regression. Different sizes of hidden layers and activation functions are used to find the hyperparameters with optimal performance. The number of inputs and outputs are kept constant at one with a maximum iteration of 500. Logistic Regression is used to classify a discrete data set and return the probability value where the Sigmoid function acts as the cost function. Finally, a confusion matrix was used to compare the performance of both models. ANN resulted in higher accuracy of 92.31% and an F1-score of 93.2% compared to Logistic Regression with 90.11% accuracy and an F1-score of 91.26%. © 2021 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
spellingShingle Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System
author_facet Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
author_sort Azizan W.A.H.W.; Rahim A.A.A.; Hassan S.L.M.; Halim I.S.A.; Abdullah N.E.
title A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System
title_short A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System
title_full A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System
title_fullStr A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System
title_full_unstemmed A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System
title_sort A Comparative Study of Two Machine Learning Algorithms for Heart Disease Prediction System
publishDate 2021
container_title 2021 IEEE 12th Control and System Graduate Research Colloquium, ICSGRC 2021 - Proceedings
container_volume
container_issue
doi_str_mv 10.1109/ICSGRC53186.2021.9515250
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85114601719&doi=10.1109%2fICSGRC53186.2021.9515250&partnerID=40&md5=d884fff47d269b49f595324ea9d550e5
description Increasing cases of heart disease cause more people to wait to be diagnosed, leading to the inefficiency of the diagnosing process in terms of time and human labor required. However, this problem can be alleviated if a prediction mechanism is in place. The purpose of this study is to predict whether a person has heart disease or not. The prediction system will use machine learning algorithms to select attributes obtained from the Cleveland dataset. Prediction is made using two machine learning models, Artificial Neural Network (ANN) and Logistic Regression. Different sizes of hidden layers and activation functions are used to find the hyperparameters with optimal performance. The number of inputs and outputs are kept constant at one with a maximum iteration of 500. Logistic Regression is used to classify a discrete data set and return the probability value where the Sigmoid function acts as the cost function. Finally, a confusion matrix was used to compare the performance of both models. ANN resulted in higher accuracy of 92.31% and an F1-score of 93.2% compared to Logistic Regression with 90.11% accuracy and an F1-score of 91.26%. © 2021 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1818940561115054080