Genetic Programming Based Automated Machine Learning in Classifying ESG Performances

AutoML offers significant benefits in solving real-life problems because it accelerates the development of machine learning models. In contexts involving real scenarios like analyzing companies' environmental, social and governance (ESG), where the dataset presents some challenges, AutoML is an...

Full description

Bibliographic Details
Published in:IEEE ACCESS
Main Authors: Abd Rahman, Abdullah Sani; Masrom, Suraya; Rahman, Rahayu Abdul; Ibrahim, Roslina; Gilal, Abdul Rehman
Format: Article
Language:English
Published: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC 2024
Subjects:
Online Access:https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001214326000001
author Abd Rahman
Abdullah Sani; Masrom
Suraya; Rahman
Rahayu Abdul; Ibrahim
Roslina; Gilal
Abdul Rehman
spellingShingle Abd Rahman
Abdullah Sani; Masrom
Suraya; Rahman
Rahayu Abdul; Ibrahim
Roslina; Gilal
Abdul Rehman
Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
Computer Science; Engineering; Telecommunications
author_facet Abd Rahman
Abdullah Sani; Masrom
Suraya; Rahman
Rahayu Abdul; Ibrahim
Roslina; Gilal
Abdul Rehman
author_sort Abd Rahman
spelling Abd Rahman, Abdullah Sani; Masrom, Suraya; Rahman, Rahayu Abdul; Ibrahim, Roslina; Gilal, Abdul Rehman
Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
IEEE ACCESS
English
Article
AutoML offers significant benefits in solving real-life problems because it accelerates the development of machine learning models. In contexts involving real scenarios like analyzing companies' environmental, social and governance (ESG), where the dataset presents some challenges, AutoML is anticipated as a promising solution to address these complexities. Although researchers have shown significant interest in exploring Genetic Programming (GP) in AutoML for handling complex datasets, a critical issue that remains unresolved is the comprehensive understanding of GP hyper-parameters that influence machine learning performance. While GP-based AutoML excels in automating many aspects of the modelling, there has been a scarcity of research that provides insight into the significance of individual features and GP population size within the models of GP-based AutoML. This paper presents a comprehensive analysis of the models' performance evaluation from multiple facets, including feature selection, GP population sizes, and different machine learning algorithms. Furthermore, this study provides insights into the association between Pearson correlations, machine learning performance, and the importance of machine learning features. The findings demonstrate that incorporating all the determinants as features in GP-based AutoML or relying solely on firm characteristics led to superior performance with an excellent trade-off between True Positive Rate and False Positive Rate. Thus, higher accuracy results exceeding 0.9 of Area Under the Curve (AUC) are presented by the proposed models. The novelty of this study lies in its empirical evaluation of different approaches to GP-based AutoML implementation. These findings provide alternative solutions for business investors to identify companies with strong sustainability practices.
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
2169-3536

2024
12

10.1109/ACCESS.2024.3393511
Computer Science; Engineering; Telecommunications
gold
WOS:001214326000001
https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001214326000001
title Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
title_short Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
title_full Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
title_fullStr Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
title_full_unstemmed Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
title_sort Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
container_title IEEE ACCESS
language English
format Article
description AutoML offers significant benefits in solving real-life problems because it accelerates the development of machine learning models. In contexts involving real scenarios like analyzing companies' environmental, social and governance (ESG), where the dataset presents some challenges, AutoML is anticipated as a promising solution to address these complexities. Although researchers have shown significant interest in exploring Genetic Programming (GP) in AutoML for handling complex datasets, a critical issue that remains unresolved is the comprehensive understanding of GP hyper-parameters that influence machine learning performance. While GP-based AutoML excels in automating many aspects of the modelling, there has been a scarcity of research that provides insight into the significance of individual features and GP population size within the models of GP-based AutoML. This paper presents a comprehensive analysis of the models' performance evaluation from multiple facets, including feature selection, GP population sizes, and different machine learning algorithms. Furthermore, this study provides insights into the association between Pearson correlations, machine learning performance, and the importance of machine learning features. The findings demonstrate that incorporating all the determinants as features in GP-based AutoML or relying solely on firm characteristics led to superior performance with an excellent trade-off between True Positive Rate and False Positive Rate. Thus, higher accuracy results exceeding 0.9 of Area Under the Curve (AUC) are presented by the proposed models. The novelty of this study lies in its empirical evaluation of different approaches to GP-based AutoML implementation. These findings provide alternative solutions for business investors to identify companies with strong sustainability practices.
publisher IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
issn 2169-3536

publishDate 2024
container_volume 12
container_issue
doi_str_mv 10.1109/ACCESS.2024.3393511
topic Computer Science; Engineering; Telecommunications
topic_facet Computer Science; Engineering; Telecommunications
accesstype gold
id WOS:001214326000001
url https://www-webofscience-com.uitm.idm.oclc.org/wos/woscc/full-record/WOS:001214326000001
record_format wos
collection Web of Science (WoS)
_version_ 1809679005339615232