An Integrated Model for Predicting Student Achievement Efficiency Using Data Envelopment Analysis and Genetic Programming Approach

Data Envelopment Analysis (DEA) is effective in evaluating efficiency across various domains, including education. However, traditional DEA methods face challenges such as computational complexity and limited predictive capabilities. This study addresses these limitations by integrating Automated Ma...

Full description

Bibliographic Details
Published in:2024 5th International Conference on Artificial Intelligence and Data Sciences, AiDAS 2024 - Proceedings
Main Author: Razi N.F.M.; Masrom S.; Baharun N.; Azmi A.Z.
Format: Conference paper
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85209627690&doi=10.1109%2fAiDAS63860.2024.10729908&partnerID=40&md5=1d0021cc47d69a93669ed71c1eca997d
id 2-s2.0-85209627690
spelling 2-s2.0-85209627690
Razi N.F.M.; Masrom S.; Baharun N.; Azmi A.Z.
An Integrated Model for Predicting Student Achievement Efficiency Using Data Envelopment Analysis and Genetic Programming Approach
2024
2024 5th International Conference on Artificial Intelligence and Data Sciences, AiDAS 2024 - Proceedings


10.1109/AiDAS63860.2024.10729908
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85209627690&doi=10.1109%2fAiDAS63860.2024.10729908&partnerID=40&md5=1d0021cc47d69a93669ed71c1eca997d
Data Envelopment Analysis (DEA) is effective in evaluating efficiency across various domains, including education. However, traditional DEA methods face challenges such as computational complexity and limited predictive capabilities. This study addresses these limitations by integrating Automated Machine Learning (AutoML) using Genetic Programming (GP) with DEA to enhance the prediction and efficiency analysis of academic performance among final year diploma students. The objective is to develop and evaluate an integrated framework that leverages AutoML and DEA to improve the accuracy and generalization of efficiency predictions. The study investigates the impact of varying population sizes, mutation rates, and crossover rates within the AutoML framework on model performance. Methodology involves collecting data from 1,099 final year diploma students through surveys measuring their CGPA, satisfaction, and various competencies. Four CCR and BCC models were constructed using different input-output combinations, with Model BCC1 identified as the best model. Cross-validation was employed to mitigate overfitting. Result revealed increasing population sizes generally improved performance, with error values decreasing for both datasets. The best-performing pipelines included RandomForestRegressor and XGBRegressor model by highlighting their robustness and adaptability. In conclusion, integrating GP-based AutoML with DEA provides a promising approach to enhancing efficiency analysis of academic performance. Future research should focus on addressing overfitting through other strategies such as regularization and extending this methodology to other domains for further validation and improvement. © 2024 IEEE.
Institute of Electrical and Electronics Engineers Inc.

English
Conference paper

author Razi N.F.M.; Masrom S.; Baharun N.; Azmi A.Z.
spellingShingle Razi N.F.M.; Masrom S.; Baharun N.; Azmi A.Z.
An Integrated Model for Predicting Student Achievement Efficiency Using Data Envelopment Analysis and Genetic Programming Approach
author_facet Razi N.F.M.; Masrom S.; Baharun N.; Azmi A.Z.
author_sort Razi N.F.M.; Masrom S.; Baharun N.; Azmi A.Z.
title An Integrated Model for Predicting Student Achievement Efficiency Using Data Envelopment Analysis and Genetic Programming Approach
title_short An Integrated Model for Predicting Student Achievement Efficiency Using Data Envelopment Analysis and Genetic Programming Approach
title_full An Integrated Model for Predicting Student Achievement Efficiency Using Data Envelopment Analysis and Genetic Programming Approach
title_fullStr An Integrated Model for Predicting Student Achievement Efficiency Using Data Envelopment Analysis and Genetic Programming Approach
title_full_unstemmed An Integrated Model for Predicting Student Achievement Efficiency Using Data Envelopment Analysis and Genetic Programming Approach
title_sort An Integrated Model for Predicting Student Achievement Efficiency Using Data Envelopment Analysis and Genetic Programming Approach
publishDate 2024
container_title 2024 5th International Conference on Artificial Intelligence and Data Sciences, AiDAS 2024 - Proceedings
container_volume
container_issue
doi_str_mv 10.1109/AiDAS63860.2024.10729908
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85209627690&doi=10.1109%2fAiDAS63860.2024.10729908&partnerID=40&md5=1d0021cc47d69a93669ed71c1eca997d
description Data Envelopment Analysis (DEA) is effective in evaluating efficiency across various domains, including education. However, traditional DEA methods face challenges such as computational complexity and limited predictive capabilities. This study addresses these limitations by integrating Automated Machine Learning (AutoML) using Genetic Programming (GP) with DEA to enhance the prediction and efficiency analysis of academic performance among final year diploma students. The objective is to develop and evaluate an integrated framework that leverages AutoML and DEA to improve the accuracy and generalization of efficiency predictions. The study investigates the impact of varying population sizes, mutation rates, and crossover rates within the AutoML framework on model performance. Methodology involves collecting data from 1,099 final year diploma students through surveys measuring their CGPA, satisfaction, and various competencies. Four CCR and BCC models were constructed using different input-output combinations, with Model BCC1 identified as the best model. Cross-validation was employed to mitigate overfitting. Result revealed increasing population sizes generally improved performance, with error values decreasing for both datasets. The best-performing pipelines included RandomForestRegressor and XGBRegressor model by highlighting their robustness and adaptability. In conclusion, integrating GP-based AutoML with DEA provides a promising approach to enhancing efficiency analysis of academic performance. Future research should focus on addressing overfitting through other strategies such as regularization and extending this methodology to other domains for further validation and improvement. © 2024 IEEE.
publisher Institute of Electrical and Electronics Engineers Inc.
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1820775439940976640