Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia

This article aims to develop an accurate air quality prediction model to handle Jakarta's air pollution challenges. In this study, data from air quality monitoring stations’ conventional air pollution indexes was employed. In the research phase, data is explored, SMOTE is used to manage imbalan...

Full description

Bibliographic Details
Published in:Lecture Notes on Data Engineering and Communications Technologies
Main Author: Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S.
Format: Book chapter
Language:English
Published: Springer Science and Business Media Deutschland GmbH 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85192770804&doi=10.1007%2f978-981-97-0293-0_24&partnerID=40&md5=852c9d63aa80cc944e5893609885be99
id 2-s2.0-85192770804
spelling 2-s2.0-85192770804
Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S.
Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia
2024
Lecture Notes on Data Engineering and Communications Technologies
191

10.1007/978-981-97-0293-0_24
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85192770804&doi=10.1007%2f978-981-97-0293-0_24&partnerID=40&md5=852c9d63aa80cc944e5893609885be99
This article aims to develop an accurate air quality prediction model to handle Jakarta's air pollution challenges. In this study, data from air quality monitoring stations’ conventional air pollution indexes was employed. In the research phase, data is explored, SMOTE is used to manage imbalances, and XGBoost is used to develop a model with the best parameters. The evaluation stage shows the model’s ability to predict air quality. With an accuracy rate of 99.516%, an F1-score of 99.528%, and a recall rate of 99.509%, the results were very astounding. These performance indicators show the model's exceptional ability to classify and predict air quality levels. Furthermore, this study investigates the significance of various variables in predicting air quality. A thorough evaluation of measures such as weight, gain, total gain, and cover indicators reveals the significance of numerous aspects. Even while SO2 helps predict air quality, the prevalence of PM2.5 on several measures reveals a significant influence. This study contributes to a better understanding of the complicated dynamics of air quality prediction by employing advanced analytical approaches and accurate models. This knowledge is useful in developing targeted solutions to address air pollution issues and promote healthier urban environments. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
Springer Science and Business Media Deutschland GmbH
23674512
English
Book chapter

author Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S.
spellingShingle Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S.
Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia
author_facet Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S.
author_sort Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S.
title Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia
title_short Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia
title_full Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia
title_fullStr Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia
title_full_unstemmed Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia
title_sort Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia
publishDate 2024
container_title Lecture Notes on Data Engineering and Communications Technologies
container_volume 191
container_issue
doi_str_mv 10.1007/978-981-97-0293-0_24
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85192770804&doi=10.1007%2f978-981-97-0293-0_24&partnerID=40&md5=852c9d63aa80cc944e5893609885be99
description This article aims to develop an accurate air quality prediction model to handle Jakarta's air pollution challenges. In this study, data from air quality monitoring stations’ conventional air pollution indexes was employed. In the research phase, data is explored, SMOTE is used to manage imbalances, and XGBoost is used to develop a model with the best parameters. The evaluation stage shows the model’s ability to predict air quality. With an accuracy rate of 99.516%, an F1-score of 99.528%, and a recall rate of 99.509%, the results were very astounding. These performance indicators show the model's exceptional ability to classify and predict air quality levels. Furthermore, this study investigates the significance of various variables in predicting air quality. A thorough evaluation of measures such as weight, gain, total gain, and cover indicators reveals the significance of numerous aspects. Even while SO2 helps predict air quality, the prevalence of PM2.5 on several measures reveals a significant influence. This study contributes to a better understanding of the complicated dynamics of air quality prediction by employing advanced analytical approaches and accurate models. This knowledge is useful in developing targeted solutions to address air pollution issues and promote healthier urban environments. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
publisher Springer Science and Business Media Deutschland GmbH
issn 23674512
language English
format Book chapter
accesstype
record_format scopus
collection Scopus
_version_ 1809678154913021952