A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution
In recent decades, air pollution has negatively affected human health and the environment. One of the important features contributing to air polluti on is called PM2.5. However, daily prediction of PM2.5 is still lacking, especially using feature selection infused into the model. Hence, the main obj...
Published in: | Environment and Natural Resources Journal |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Faculty of Environment and Resource Studies,Mahidol University
2024
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85210087988&doi=10.32526%2fennrj%2f22%2f20240049&partnerID=40&md5=76e38040c93ada19fcb08bd0bdf4e3e6 |
id |
2-s2.0-85210087988 |
---|---|
spelling |
2-s2.0-85210087988 Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N. A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution 2024 Environment and Natural Resources Journal 22 6 10.32526/ennrj/22/20240049 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85210087988&doi=10.32526%2fennrj%2f22%2f20240049&partnerID=40&md5=76e38040c93ada19fcb08bd0bdf4e3e6 In recent decades, air pollution has negatively affected human health and the environment. One of the important features contributing to air polluti on is called PM2.5. However, daily prediction of PM2.5 is still lacking, especially using feature selection infused into the model. Hence, the main objective of this research is to utilize the feature selection procedures by proposing two stages feature selection methods namely adjusted correlation sharing t-test (adjcorT) and radial basis function neural network (RBFNN) in identifying the important features. This consequently also helps enhance the prediction of daily PM2.5 concentrations. Secondary data were obtained from the Department of Environment Malaysia (DOE) from 2018 until 2022 that consists of 5 years of air pollutant daily data. The results found that adjcorT-RBFNN identified the NO2, PM2.5, PM10, CO, O3, wind speed and SO2 as important features. The finding revealed that the accuracy, sensitivity, specificity, precision, F1 score and AUROC value, for a day-ahead prediction in Shah Alam are 0.756, 0.801, 0.717, 0.717, 0.757, and 0.758 respectively. Additionally, the predicted model may serve as an instrument for an early warning system, providing local authorities with information on air quality for formulation of strategies of air quality improvement. © 2024, Faculty of Environment and Resource Studies,Mahidol University. All rights reserved. Faculty of Environment and Resource Studies,Mahidol University 16865456 English Article All Open Access; Gold Open Access |
author |
Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N. |
spellingShingle |
Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N. A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution |
author_facet |
Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N. |
author_sort |
Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N. |
title |
A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution |
title_short |
A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution |
title_full |
A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution |
title_fullStr |
A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution |
title_full_unstemmed |
A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution |
title_sort |
A Two-Stage Feature Selection Method to Enhance Prediction of Daily PM2.5 Concentration Air Pollution |
publishDate |
2024 |
container_title |
Environment and Natural Resources Journal |
container_volume |
22 |
container_issue |
6 |
doi_str_mv |
10.32526/ennrj/22/20240049 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85210087988&doi=10.32526%2fennrj%2f22%2f20240049&partnerID=40&md5=76e38040c93ada19fcb08bd0bdf4e3e6 |
description |
In recent decades, air pollution has negatively affected human health and the environment. One of the important features contributing to air polluti on is called PM2.5. However, daily prediction of PM2.5 is still lacking, especially using feature selection infused into the model. Hence, the main objective of this research is to utilize the feature selection procedures by proposing two stages feature selection methods namely adjusted correlation sharing t-test (adjcorT) and radial basis function neural network (RBFNN) in identifying the important features. This consequently also helps enhance the prediction of daily PM2.5 concentrations. Secondary data were obtained from the Department of Environment Malaysia (DOE) from 2018 until 2022 that consists of 5 years of air pollutant daily data. The results found that adjcorT-RBFNN identified the NO2, PM2.5, PM10, CO, O3, wind speed and SO2 as important features. The finding revealed that the accuracy, sensitivity, specificity, precision, F1 score and AUROC value, for a day-ahead prediction in Shah Alam are 0.756, 0.801, 0.717, 0.717, 0.757, and 0.758 respectively. Additionally, the predicted model may serve as an instrument for an early warning system, providing local authorities with information on air quality for formulation of strategies of air quality improvement. © 2024, Faculty of Environment and Resource Studies,Mahidol University. All rights reserved. |
publisher |
Faculty of Environment and Resource Studies,Mahidol University |
issn |
16865456 |
language |
English |
format |
Article |
accesstype |
All Open Access; Gold Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1820775431636254720 |