Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia

This article aims to develop an accurate air quality prediction model to handle Jakarta's air pollution challenges. In this study, data from air quality monitoring stations’ conventional air pollution indexes was employed. In the research phase, data is explored, SMOTE is used to manage imbalan...

Full description

Bibliographic Details
Published in:Lecture Notes on Data Engineering and Communications Technologies
Main Author: Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S.
Format: Book chapter
Language:English
Published: Springer Science and Business Media Deutschland GmbH 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85192770804&doi=10.1007%2f978-981-97-0293-0_24&partnerID=40&md5=852c9d63aa80cc944e5893609885be99
Description
Summary:This article aims to develop an accurate air quality prediction model to handle Jakarta's air pollution challenges. In this study, data from air quality monitoring stations’ conventional air pollution indexes was employed. In the research phase, data is explored, SMOTE is used to manage imbalances, and XGBoost is used to develop a model with the best parameters. The evaluation stage shows the model’s ability to predict air quality. With an accuracy rate of 99.516%, an F1-score of 99.528%, and a recall rate of 99.509%, the results were very astounding. These performance indicators show the model's exceptional ability to classify and predict air quality levels. Furthermore, this study investigates the significance of various variables in predicting air quality. A thorough evaluation of measures such as weight, gain, total gain, and cover indicators reveals the significance of numerous aspects. Even while SO2 helps predict air quality, the prevalence of PM2.5 on several measures reveals a significant influence. This study contributes to a better understanding of the complicated dynamics of air quality prediction by employing advanced analytical approaches and accurate models. This knowledge is useful in developing targeted solutions to address air pollution issues and promote healthier urban environments. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
ISSN:23674512
DOI:10.1007/978-981-97-0293-0_24