Genetic Programming Based Automated Machine Learning in Classifying ESG Performances

AutoML offers significant benefits in solving real-life problems because it accelerates the development of machine learning models. In contexts involving real scenarios like analyzing companies’ environmental, social and governance (ESG), where the dataset presents some challenges, AutoML is anticip...

Full description

Bibliographic Details
Published in:IEEE Access
Main Author: Rahman A.S.A.; Masrom S.; Rahman R.A.; Ibrahim R.; Gilal A.R.
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers Inc. 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191844347&doi=10.1109%2fACCESS.2024.3393511&partnerID=40&md5=fd3057ede4d6e3d897c000247b746d5b
id 2-s2.0-85191844347
spelling 2-s2.0-85191844347
Rahman A.S.A.; Masrom S.; Rahman R.A.; Ibrahim R.; Gilal A.R.
Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
2024
IEEE Access
12

10.1109/ACCESS.2024.3393511
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191844347&doi=10.1109%2fACCESS.2024.3393511&partnerID=40&md5=fd3057ede4d6e3d897c000247b746d5b
AutoML offers significant benefits in solving real-life problems because it accelerates the development of machine learning models. In contexts involving real scenarios like analyzing companies’ environmental, social and governance (ESG), where the dataset presents some challenges, AutoML is anticipated as a promising solution to address these complexities. Although researchers have shown significant interest in exploring Genetic Programming (GP) in AutoML for handling complex datasets, a critical issue that remains unresolved is the comprehensive understanding of GP hyper-parameters that influence machine learning performance. While GP-based AutoML excels in automating many aspects of the modelling, there has been a scarcity of research that provides insight into the significance of individual features and GP population size within the models of GP-based AutoML. This paper presents a comprehensive analysis of the models’ performance evaluation from multiple facets, including feature selection, GP population sizes, and different machine learning algorithms. Furthermore, this study provides insights into the association between Pearson correlations, machine learning performance, and the importance of machine learning features. The findings demonstrate that incorporating all the determinants as features in GP-based AutoML or relying solely on firm characteristics led to superior performance with an excellent trade-off between True Positive Rate and False Positive Rate. Thus, higher accuracy results exceeding 0.9 of Area Under the Curve (AUC) are presented by the proposed models. The novelty of this study lies in its empirical evaluation of different approaches to GP-based AutoML implementation. These findings provide alternative solutions for business investors to identify companies with strong sustainability practices. © 2024 The Authors.
Institute of Electrical and Electronics Engineers Inc.
21693536
English
Article
All Open Access; Gold Open Access
author Rahman A.S.A.; Masrom S.; Rahman R.A.; Ibrahim R.; Gilal A.R.
spellingShingle Rahman A.S.A.; Masrom S.; Rahman R.A.; Ibrahim R.; Gilal A.R.
Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
author_facet Rahman A.S.A.; Masrom S.; Rahman R.A.; Ibrahim R.; Gilal A.R.
author_sort Rahman A.S.A.; Masrom S.; Rahman R.A.; Ibrahim R.; Gilal A.R.
title Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
title_short Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
title_full Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
title_fullStr Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
title_full_unstemmed Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
title_sort Genetic Programming Based Automated Machine Learning in Classifying ESG Performances
publishDate 2024
container_title IEEE Access
container_volume 12
container_issue
doi_str_mv 10.1109/ACCESS.2024.3393511
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191844347&doi=10.1109%2fACCESS.2024.3393511&partnerID=40&md5=fd3057ede4d6e3d897c000247b746d5b
description AutoML offers significant benefits in solving real-life problems because it accelerates the development of machine learning models. In contexts involving real scenarios like analyzing companies’ environmental, social and governance (ESG), where the dataset presents some challenges, AutoML is anticipated as a promising solution to address these complexities. Although researchers have shown significant interest in exploring Genetic Programming (GP) in AutoML for handling complex datasets, a critical issue that remains unresolved is the comprehensive understanding of GP hyper-parameters that influence machine learning performance. While GP-based AutoML excels in automating many aspects of the modelling, there has been a scarcity of research that provides insight into the significance of individual features and GP population size within the models of GP-based AutoML. This paper presents a comprehensive analysis of the models’ performance evaluation from multiple facets, including feature selection, GP population sizes, and different machine learning algorithms. Furthermore, this study provides insights into the association between Pearson correlations, machine learning performance, and the importance of machine learning features. The findings demonstrate that incorporating all the determinants as features in GP-based AutoML or relying solely on firm characteristics led to superior performance with an excellent trade-off between True Positive Rate and False Positive Rate. Thus, higher accuracy results exceeding 0.9 of Area Under the Curve (AUC) are presented by the proposed models. The novelty of this study lies in its empirical evaluation of different approaches to GP-based AutoML implementation. These findings provide alternative solutions for business investors to identify companies with strong sustainability practices. © 2024 The Authors.
publisher Institute of Electrical and Electronics Engineers Inc.
issn 21693536
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809678155454087168