Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia
This article aims to develop an accurate air quality prediction model to handle Jakarta's air pollution challenges. In this study, data from air quality monitoring stations’ conventional air pollution indexes was employed. In the research phase, data is explored, SMOTE is used to manage imbalan...
Published in: | Lecture Notes on Data Engineering and Communications Technologies |
---|---|
Main Author: | |
Format: | Book chapter |
Language: | English |
Published: |
Springer Science and Business Media Deutschland GmbH
2024
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85192770804&doi=10.1007%2f978-981-97-0293-0_24&partnerID=40&md5=852c9d63aa80cc944e5893609885be99 |
id |
2-s2.0-85192770804 |
---|---|
spelling |
2-s2.0-85192770804 Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S. Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia 2024 Lecture Notes on Data Engineering and Communications Technologies 191 10.1007/978-981-97-0293-0_24 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85192770804&doi=10.1007%2f978-981-97-0293-0_24&partnerID=40&md5=852c9d63aa80cc944e5893609885be99 This article aims to develop an accurate air quality prediction model to handle Jakarta's air pollution challenges. In this study, data from air quality monitoring stations’ conventional air pollution indexes was employed. In the research phase, data is explored, SMOTE is used to manage imbalances, and XGBoost is used to develop a model with the best parameters. The evaluation stage shows the model’s ability to predict air quality. With an accuracy rate of 99.516%, an F1-score of 99.528%, and a recall rate of 99.509%, the results were very astounding. These performance indicators show the model's exceptional ability to classify and predict air quality levels. Furthermore, this study investigates the significance of various variables in predicting air quality. A thorough evaluation of measures such as weight, gain, total gain, and cover indicators reveals the significance of numerous aspects. Even while SO2 helps predict air quality, the prevalence of PM2.5 on several measures reveals a significant influence. This study contributes to a better understanding of the complicated dynamics of air quality prediction by employing advanced analytical approaches and accurate models. This knowledge is useful in developing targeted solutions to address air pollution issues and promote healthier urban environments. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. Springer Science and Business Media Deutschland GmbH 23674512 English Book chapter |
author |
Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S. |
spellingShingle |
Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S. Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia |
author_facet |
Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S. |
author_sort |
Wibowo W.; Al Azies H.; Wilujeng S.A.; Abdul-Rahman S. |
title |
Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia |
title_short |
Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia |
title_full |
Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia |
title_fullStr |
Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia |
title_full_unstemmed |
Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia |
title_sort |
Harnessing the XGBoost Ensemble for Intelligent Prediction and Identification of Factors with a High Impact on Air Quality: A Case Study of Urban Areas in Jakarta Province, Indonesia |
publishDate |
2024 |
container_title |
Lecture Notes on Data Engineering and Communications Technologies |
container_volume |
191 |
container_issue |
|
doi_str_mv |
10.1007/978-981-97-0293-0_24 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85192770804&doi=10.1007%2f978-981-97-0293-0_24&partnerID=40&md5=852c9d63aa80cc944e5893609885be99 |
description |
This article aims to develop an accurate air quality prediction model to handle Jakarta's air pollution challenges. In this study, data from air quality monitoring stations’ conventional air pollution indexes was employed. In the research phase, data is explored, SMOTE is used to manage imbalances, and XGBoost is used to develop a model with the best parameters. The evaluation stage shows the model’s ability to predict air quality. With an accuracy rate of 99.516%, an F1-score of 99.528%, and a recall rate of 99.509%, the results were very astounding. These performance indicators show the model's exceptional ability to classify and predict air quality levels. Furthermore, this study investigates the significance of various variables in predicting air quality. A thorough evaluation of measures such as weight, gain, total gain, and cover indicators reveals the significance of numerous aspects. Even while SO2 helps predict air quality, the prevalence of PM2.5 on several measures reveals a significant influence. This study contributes to a better understanding of the complicated dynamics of air quality prediction by employing advanced analytical approaches and accurate models. This knowledge is useful in developing targeted solutions to address air pollution issues and promote healthier urban environments. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024. |
publisher |
Springer Science and Business Media Deutschland GmbH |
issn |
23674512 |
language |
English |
format |
Book chapter |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809678154913021952 |