Feature selection techniques and classification algorithms for student performance classification: a review

The process of categorizing students’ performance based on input data, encompassing demographic information and final exam results, is recognized as student performance classification. Educational data mining has gained traction in assessing students’ performance. However, this study entails the nee...

Full description

Bibliographic Details
Published in:International Journal of Electrical and Computer Engineering
Main Author: Alias M.A.H.; Hambali N.; Aziz M.A.A.; Taib M.N.; Jailani R.
Format: Review
Language:English
Published: Institute of Advanced Engineering and Science 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191019150&doi=10.11591%2fijece.v14i3.pp3230-3243&partnerID=40&md5=85c1032ece796edad187afee26234778
Description
Summary:The process of categorizing students’ performance based on input data, encompassing demographic information and final exam results, is recognized as student performance classification. Educational data mining has gained traction in assessing students’ performance. However, this study entails the need to analyze the diverse attributes of students’ information within an educational institution by using data mining techniques. This study thoroughly examines both previous and current methodologies presented by researchers, addressing two main aspects: data preprocessing and classification algorithms applied in student performance classification. Data preprocessing specifically delves into the exploration of feature selection techniques, encompassing three types of feature selection and search methods. These techniques aim to identify the most significant features, eliminate unnecessary ones, and reduce data dimensionality. In addition, classification algorithms play a crucial role in categorizing or predicting student performance. Models such as k-nearest neighbors (KNN), decision tree (DT), artificial neural networks (ANN), and linear models (LR) were scrutinized based on their performance in prior research. Ultimately, this study highlights the potential for further exploration of feature selection techniques like information gain, Chi-square, and sequential selection, particularly when applied to new datasets such as students’ online learning activities, utilizing a variety of classification algorithms. © 2024 Institute of Advanced Engineering and Science. All rights reserved.
ISSN:20888708
DOI:10.11591/ijece.v14i3.pp3230-3243