An Integrated Model for Predicting Student Achievement Efficiency Using Data Envelopment Analysis and Genetic Programming Approach

Data Envelopment Analysis (DEA) is effective in evaluating efficiency across various domains, including education. However, traditional DEA methods face challenges such as computational complexity and limited predictive capabilities. This study addresses these limitations by integrating Automated Ma...

Full description

Bibliographic Details
Published in:2024 5th International Conference on Artificial Intelligence and Data Sciences, AiDAS 2024 - Proceedings
Main Author: Razi N.F.M.; Masrom S.; Baharun N.; Azmi A.Z.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85209627690&doi=10.1109%2fAiDAS63860.2024.10729908&partnerID=40&md5=1d0021cc47d69a93669ed71c1eca997d
Description
Summary:Data Envelopment Analysis (DEA) is effective in evaluating efficiency across various domains, including education. However, traditional DEA methods face challenges such as computational complexity and limited predictive capabilities. This study addresses these limitations by integrating Automated Machine Learning (AutoML) using Genetic Programming (GP) with DEA to enhance the prediction and efficiency analysis of academic performance among final year diploma students. The objective is to develop and evaluate an integrated framework that leverages AutoML and DEA to improve the accuracy and generalization of efficiency predictions. The study investigates the impact of varying population sizes, mutation rates, and crossover rates within the AutoML framework on model performance. Methodology involves collecting data from 1,099 final year diploma students through surveys measuring their CGPA, satisfaction, and various competencies. Four CCR and BCC models were constructed using different input-output combinations, with Model BCC1 identified as the best model. Cross-validation was employed to mitigate overfitting. Result revealed increasing population sizes generally improved performance, with error values decreasing for both datasets. The best-performing pipelines included RandomForestRegressor and XGBRegressor model by highlighting their robustness and adaptability. In conclusion, integrating GP-based AutoML with DEA provides a promising approach to enhancing efficiency analysis of academic performance. Future research should focus on addressing overfitting through other strategies such as regularization and extending this methodology to other domains for further validation and improvement. © 2024 IEEE.
ISSN:
DOI:10.1109/AiDAS63860.2024.10729908