Machine learning prediction for academic misconduct prediction: an analysis of binary classification metrics

Academic misconduct is unethical behavior in academic work. To sustain integrity culture and mitigating unethical conducts among higher education institutions community, the academic misconduct detection must be done at an earlier stage. Thus, this study attempted to provide a new empirical contribu...

Full description

Bibliographic Details
Published in:Bulletin of Electrical Engineering and Informatics
Main Author: Masrom S.; Samad N.H.A.; Septiyanti R.; Roslan N.; Rahman R.A.
Format: Article
Language:English
Published: Institute of Advanced Engineering and Science 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186193430&doi=10.11591%2feei.v13i1.5629&partnerID=40&md5=37a64235fe558381037c184de240ca08
id 2-s2.0-85186193430
spelling 2-s2.0-85186193430
Masrom S.; Samad N.H.A.; Septiyanti R.; Roslan N.; Rahman R.A.
Machine learning prediction for academic misconduct prediction: an analysis of binary classification metrics
2024
Bulletin of Electrical Engineering and Informatics
13
1
10.11591/eei.v13i1.5629
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186193430&doi=10.11591%2feei.v13i1.5629&partnerID=40&md5=37a64235fe558381037c184de240ca08
Academic misconduct is unethical behavior in academic work. To sustain integrity culture and mitigating unethical conducts among higher education institutions community, the academic misconduct detection must be done at an earlier stage. Thus, this study attempted to provide a new empirical contribution with the analysis of binary classification performances metrics to describe the ability of machine learning in predicting academic misconduct. Four machine learning algorithms have been used namely generalized linear model (GLM), logistic regression (LR), decision tree (DT), and random forest (RF). Beside performances comparison, this paper presents the analysis of academic misconduct factors that were constructed based on demography and fraud triangle theory (FTT). The findings showed that all the four machine learning algorithms have obtained good ability in the prediction models with the accuracy at above 80% and below 20% of the classification errors. Rationalization from the FTT attributes has shown as the most important factor in GLM, LR, and DT. In RF, opportunity of FTT attributes have become the most important. Compared to FTT attributes, demography attributes were not providing much benefits to all the machine learning models but remain applicable at very low weight correlations. © 2024, Institute of Advanced Engineering and Science. All rights reserved.
Institute of Advanced Engineering and Science
20893191
English
Article
All Open Access; Gold Open Access
author Masrom S.; Samad N.H.A.; Septiyanti R.; Roslan N.; Rahman R.A.
spellingShingle Masrom S.; Samad N.H.A.; Septiyanti R.; Roslan N.; Rahman R.A.
Machine learning prediction for academic misconduct prediction: an analysis of binary classification metrics
author_facet Masrom S.; Samad N.H.A.; Septiyanti R.; Roslan N.; Rahman R.A.
author_sort Masrom S.; Samad N.H.A.; Septiyanti R.; Roslan N.; Rahman R.A.
title Machine learning prediction for academic misconduct prediction: an analysis of binary classification metrics
title_short Machine learning prediction for academic misconduct prediction: an analysis of binary classification metrics
title_full Machine learning prediction for academic misconduct prediction: an analysis of binary classification metrics
title_fullStr Machine learning prediction for academic misconduct prediction: an analysis of binary classification metrics
title_full_unstemmed Machine learning prediction for academic misconduct prediction: an analysis of binary classification metrics
title_sort Machine learning prediction for academic misconduct prediction: an analysis of binary classification metrics
publishDate 2024
container_title Bulletin of Electrical Engineering and Informatics
container_volume 13
container_issue 1
doi_str_mv 10.11591/eei.v13i1.5629
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85186193430&doi=10.11591%2feei.v13i1.5629&partnerID=40&md5=37a64235fe558381037c184de240ca08
description Academic misconduct is unethical behavior in academic work. To sustain integrity culture and mitigating unethical conducts among higher education institutions community, the academic misconduct detection must be done at an earlier stage. Thus, this study attempted to provide a new empirical contribution with the analysis of binary classification performances metrics to describe the ability of machine learning in predicting academic misconduct. Four machine learning algorithms have been used namely generalized linear model (GLM), logistic regression (LR), decision tree (DT), and random forest (RF). Beside performances comparison, this paper presents the analysis of academic misconduct factors that were constructed based on demography and fraud triangle theory (FTT). The findings showed that all the four machine learning algorithms have obtained good ability in the prediction models with the accuracy at above 80% and below 20% of the classification errors. Rationalization from the FTT attributes has shown as the most important factor in GLM, LR, and DT. In RF, opportunity of FTT attributes have become the most important. Compared to FTT attributes, demography attributes were not providing much benefits to all the machine learning models but remain applicable at very low weight correlations. © 2024, Institute of Advanced Engineering and Science. All rights reserved.
publisher Institute of Advanced Engineering and Science
issn 20893191
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809678011196243968