Development of machine learning algorithms in student performance classification based on online learning activities
The field of educational data mining has gained significant traction for its pivotal role in assessing students' academic achievements. However, to ensure the compatibility of algorithms with the selected dataset, it is imperative for a comprehensive analysis of the algorithms to be done. This...
Published in: | International Journal of Electrical and Computer Engineering |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Institute of Advanced Engineering and Science
2024
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85206088644&doi=10.11591%2fijece.v14i6.pp7126-7136&partnerID=40&md5=53a004de08c33d835afcde4b91a18fdd |
id |
2-s2.0-85206088644 |
---|---|
spelling |
2-s2.0-85206088644 Alias M.A.H.; Aziz M.A.A.; Hambali N.; Taib M.N. Development of machine learning algorithms in student performance classification based on online learning activities 2024 International Journal of Electrical and Computer Engineering 14 6 10.11591/ijece.v14i6.pp7126-7136 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85206088644&doi=10.11591%2fijece.v14i6.pp7126-7136&partnerID=40&md5=53a004de08c33d835afcde4b91a18fdd The field of educational data mining has gained significant traction for its pivotal role in assessing students' academic achievements. However, to ensure the compatibility of algorithms with the selected dataset, it is imperative for a comprehensive analysis of the algorithms to be done. This study delved into the development of machine learning algorithms utilizing students' online learning activities to effectively classify their academic performance. In the data cleaning stage, we employed VarianceThreshold for discarding features that have all zeros. Feature selection and oversampling techniques were integrated into the data preprocessing, using information gain to facilitate efficient feature selection and synthetic minority oversampling technique (SMOTE) to address class imbalance. In the classification phase, three supervised machine learning algorithms: k-nearest neighbors (KNN), multi-layer perceptron (MLP), and logistic regression (LR) were implemented, with 3-fold cross-validation to enhance robustness. Classifiers’ performance underwent refinement through hyperparameter tuning via GridSearchCV. Evaluation metrics, encompassing accuracy, precision, recall, and F1-score, were meticulously measured for each classifier. Notably, the study revealed that both MLP and LR achieved impeccable scores of 100% across all metrics, while KNN exhibited a noticeable performance boost after using hyperparameter tuning. © 2024 Institute of Advanced Engineering and Science. All rights reserved. Institute of Advanced Engineering and Science 20888708 English Article |
author |
Alias M.A.H.; Aziz M.A.A.; Hambali N.; Taib M.N. |
spellingShingle |
Alias M.A.H.; Aziz M.A.A.; Hambali N.; Taib M.N. Development of machine learning algorithms in student performance classification based on online learning activities |
author_facet |
Alias M.A.H.; Aziz M.A.A.; Hambali N.; Taib M.N. |
author_sort |
Alias M.A.H.; Aziz M.A.A.; Hambali N.; Taib M.N. |
title |
Development of machine learning algorithms in student performance classification based on online learning activities |
title_short |
Development of machine learning algorithms in student performance classification based on online learning activities |
title_full |
Development of machine learning algorithms in student performance classification based on online learning activities |
title_fullStr |
Development of machine learning algorithms in student performance classification based on online learning activities |
title_full_unstemmed |
Development of machine learning algorithms in student performance classification based on online learning activities |
title_sort |
Development of machine learning algorithms in student performance classification based on online learning activities |
publishDate |
2024 |
container_title |
International Journal of Electrical and Computer Engineering |
container_volume |
14 |
container_issue |
6 |
doi_str_mv |
10.11591/ijece.v14i6.pp7126-7136 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85206088644&doi=10.11591%2fijece.v14i6.pp7126-7136&partnerID=40&md5=53a004de08c33d835afcde4b91a18fdd |
description |
The field of educational data mining has gained significant traction for its pivotal role in assessing students' academic achievements. However, to ensure the compatibility of algorithms with the selected dataset, it is imperative for a comprehensive analysis of the algorithms to be done. This study delved into the development of machine learning algorithms utilizing students' online learning activities to effectively classify their academic performance. In the data cleaning stage, we employed VarianceThreshold for discarding features that have all zeros. Feature selection and oversampling techniques were integrated into the data preprocessing, using information gain to facilitate efficient feature selection and synthetic minority oversampling technique (SMOTE) to address class imbalance. In the classification phase, three supervised machine learning algorithms: k-nearest neighbors (KNN), multi-layer perceptron (MLP), and logistic regression (LR) were implemented, with 3-fold cross-validation to enhance robustness. Classifiers’ performance underwent refinement through hyperparameter tuning via GridSearchCV. Evaluation metrics, encompassing accuracy, precision, recall, and F1-score, were meticulously measured for each classifier. Notably, the study revealed that both MLP and LR achieved impeccable scores of 100% across all metrics, while KNN exhibited a noticeable performance boost after using hyperparameter tuning. © 2024 Institute of Advanced Engineering and Science. All rights reserved. |
publisher |
Institute of Advanced Engineering and Science |
issn |
20888708 |
language |
English |
format |
Article |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1814778497052180480 |