A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution

In recent decades, air pollution has negatively affected human health and the environment. One of the important features contributing to air polluti on is called PM2.5. However, daily prediction of PM2.5 is still lacking, especially using feature selection infused into the model. Hence, the main obj...

Full description

Bibliographic Details
Published in:Environment and Natural Resources Journal
Main Author: Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N.
Format: Article
Language:English
Published: Faculty of Environment and Resource Studies,Mahidol University 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85210087988&doi=10.32526%2fennrj%2f22%2f20240049&partnerID=40&md5=76e38040c93ada19fcb08bd0bdf4e3e6
id 2-s2.0-85210087988
spelling 2-s2.0-85210087988
Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N.
A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution
2024
Environment and Natural Resources Journal
22
6
10.32526/ennrj/22/20240049
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85210087988&doi=10.32526%2fennrj%2f22%2f20240049&partnerID=40&md5=76e38040c93ada19fcb08bd0bdf4e3e6
In recent decades, air pollution has negatively affected human health and the environment. One of the important features contributing to air polluti on is called PM2.5. However, daily prediction of PM2.5 is still lacking, especially using feature selection infused into the model. Hence, the main objective of this research is to utilize the feature selection procedures by proposing two stages feature selection methods namely adjusted correlation sharing t-test (adjcorT) and radial basis function neural network (RBFNN) in identifying the important features. This consequently also helps enhance the prediction of daily PM2.5 concentrations. Secondary data were obtained from the Department of Environment Malaysia (DOE) from 2018 until 2022 that consists of 5 years of air pollutant daily data. The results found that adjcorT-RBFNN identified the NO2, PM2.5, PM10, CO, O3, wind speed and SO2 as important features. The finding revealed that the accuracy, sensitivity, specificity, precision, F1 score and AUROC value, for a day-ahead prediction in Shah Alam are 0.756, 0.801, 0.717, 0.717, 0.757, and 0.758 respectively. Additionally, the predicted model may serve as an instrument for an early warning system, providing local authorities with information on air quality for formulation of strategies of air quality improvement. © 2024, Faculty of Environment and Resource Studies,Mahidol University. All rights reserved.
Faculty of Environment and Resource Studies,Mahidol University
16865456
English
Article
All Open Access; Gold Open Access
author Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N.
spellingShingle Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N.
A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution
author_facet Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N.
author_sort Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N.
title A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution
title_short A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution
title_full A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution
title_fullStr A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution
title_full_unstemmed A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution
title_sort A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution
publishDate 2024
container_title Environment and Natural Resources Journal
container_volume 22
container_issue 6
doi_str_mv 10.32526/ennrj/22/20240049
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85210087988&doi=10.32526%2fennrj%2f22%2f20240049&partnerID=40&md5=76e38040c93ada19fcb08bd0bdf4e3e6
description In recent decades, air pollution has negatively affected human health and the environment. One of the important features contributing to air polluti on is called PM2.5. However, daily prediction of PM2.5 is still lacking, especially using feature selection infused into the model. Hence, the main objective of this research is to utilize the feature selection procedures by proposing two stages feature selection methods namely adjusted correlation sharing t-test (adjcorT) and radial basis function neural network (RBFNN) in identifying the important features. This consequently also helps enhance the prediction of daily PM2.5 concentrations. Secondary data were obtained from the Department of Environment Malaysia (DOE) from 2018 until 2022 that consists of 5 years of air pollutant daily data. The results found that adjcorT-RBFNN identified the NO2, PM2.5, PM10, CO, O3, wind speed and SO2 as important features. The finding revealed that the accuracy, sensitivity, specificity, precision, F1 score and AUROC value, for a day-ahead prediction in Shah Alam are 0.756, 0.801, 0.717, 0.717, 0.757, and 0.758 respectively. Additionally, the predicted model may serve as an instrument for an early warning system, providing local authorities with information on air quality for formulation of strategies of air quality improvement. © 2024, Faculty of Environment and Resource Studies,Mahidol University. All rights reserved.
publisher Faculty of Environment and Resource Studies,Mahidol University
issn 16865456
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1820775431636254720