Feature Selection Methods Using RBFNN Based on Enhance Air Quality Prediction: Insights from Shah Alam

This study examines the predictive efficiency of several feature selection approaches in air quality models aimed to predict next-day PM2.5 concentrations in Shah Alam, Malaysia. Air pollution in urban areas is a significant public health concern, and accurate prediction models are essential for tim...

全面介绍

书目详细资料
发表在:International Journal of Advanced Computer Science and Applications
主要作者: 2-s2.0-86000672793
格式: 文件
语言:English
出版: Science and Information Organization 2024
在线阅读:https://www.scopus.com/inward/record.uri?eid=2-s2.0-86000672793&doi=10.14569%2fIJACSA.2024.0151148&partnerID=40&md5=ac387223414c643c39b33d00b142c970
id Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N.
spelling Arafin S.K.; Ul-Saufie A.Z.; Ghani N.A.M.; Ibrahim N.
2-s2.0-86000672793
Feature Selection Methods Using RBFNN Based on Enhance Air Quality Prediction: Insights from Shah Alam
2024
International Journal of Advanced Computer Science and Applications
15
11
10.14569/IJACSA.2024.0151148
https://www.scopus.com/inward/record.uri?eid=2-s2.0-86000672793&doi=10.14569%2fIJACSA.2024.0151148&partnerID=40&md5=ac387223414c643c39b33d00b142c970
This study examines the predictive efficiency of several feature selection approaches in air quality models aimed to predict next-day PM2.5 concentrations in Shah Alam, Malaysia. Air pollution in urban areas is a significant public health concern, and accurate prediction models are essential for timely interventions. However, determining the most important parameters to include in these models remains difficult, especially in complex urban areas with several pollution sources. To address this, we employed three different feature selection methods and applied them to a dataset comprising 43,824 air quality data points provided by the Department of Environmental Malaysia. The data set contained ten variables, such as gas pollutants and meteorological indicators. Each feature selection approach determined top eight variables to include in a Radial Basis Function Neural Network (RBFNN) model. The results showed that ReliefF outperformed Lasso and mRMR in terms of accuracy, specificity, precision, F1 Score, and AUROC, making it the most effective feature selection method for this study. This study contributes to the body of knowledge on air quality modelling by emphasising the relevance of using proper feature selection techniques that are suited to the specific characteristics of the dataset and urban area. Furthermore, it proposes that future study should look into the use of ReliefF-RBFNN in other settings, such as suburban and rural areas, as well as hybrid feature selection approaches to improve prediction performance across several context. © (2024), (Science and Information Organization). All Rights Reserved.
Science and Information Organization
2158107X
English
Article
All Open Access; Gold Open Access
author 2-s2.0-86000672793
spellingShingle 2-s2.0-86000672793
Feature Selection Methods Using RBFNN Based on Enhance Air Quality Prediction: Insights from Shah Alam
author_facet 2-s2.0-86000672793
author_sort 2-s2.0-86000672793
title Feature Selection Methods Using RBFNN Based on Enhance Air Quality Prediction: Insights from Shah Alam
title_short Feature Selection Methods Using RBFNN Based on Enhance Air Quality Prediction: Insights from Shah Alam
title_full Feature Selection Methods Using RBFNN Based on Enhance Air Quality Prediction: Insights from Shah Alam
title_fullStr Feature Selection Methods Using RBFNN Based on Enhance Air Quality Prediction: Insights from Shah Alam
title_full_unstemmed Feature Selection Methods Using RBFNN Based on Enhance Air Quality Prediction: Insights from Shah Alam
title_sort Feature Selection Methods Using RBFNN Based on Enhance Air Quality Prediction: Insights from Shah Alam
publishDate 2024
container_title International Journal of Advanced Computer Science and Applications
container_volume 15
container_issue 11
doi_str_mv 10.14569/IJACSA.2024.0151148
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-86000672793&doi=10.14569%2fIJACSA.2024.0151148&partnerID=40&md5=ac387223414c643c39b33d00b142c970
description This study examines the predictive efficiency of several feature selection approaches in air quality models aimed to predict next-day PM2.5 concentrations in Shah Alam, Malaysia. Air pollution in urban areas is a significant public health concern, and accurate prediction models are essential for timely interventions. However, determining the most important parameters to include in these models remains difficult, especially in complex urban areas with several pollution sources. To address this, we employed three different feature selection methods and applied them to a dataset comprising 43,824 air quality data points provided by the Department of Environmental Malaysia. The data set contained ten variables, such as gas pollutants and meteorological indicators. Each feature selection approach determined top eight variables to include in a Radial Basis Function Neural Network (RBFNN) model. The results showed that ReliefF outperformed Lasso and mRMR in terms of accuracy, specificity, precision, F1 Score, and AUROC, making it the most effective feature selection method for this study. This study contributes to the body of knowledge on air quality modelling by emphasising the relevance of using proper feature selection techniques that are suited to the specific characteristics of the dataset and urban area. Furthermore, it proposes that future study should look into the use of ReliefF-RBFNN in other settings, such as suburban and rural areas, as well as hybrid feature selection approaches to improve prediction performance across several context. © (2024), (Science and Information Organization). All Rights Reserved.
publisher Science and Information Organization
issn 2158107X
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1828987861175631872