Feature Selection Methods Using RBFNN Based on Enhance Air Quality Prediction: Insights from Shah Alam

This study examines the predictive efficiency of several feature selection approaches in air quality models aimed to predict next-day PM2.5 concentrations in Shah Alam, Malaysia. Air pollution in urban areas is a significant public health concern, and accurate prediction models are essential for tim...

全面介紹

書目詳細資料
發表在:International Journal of Advanced Computer Science and Applications
主要作者: 2-s2.0-86000672793
格式: Article
語言:English
出版: Science and Information Organization 2024
在線閱讀:https://www.scopus.com/inward/record.uri?eid=2-s2.0-86000672793&doi=10.14569%2fIJACSA.2024.0151148&partnerID=40&md5=ac387223414c643c39b33d00b142c970
實物特徵
總結:This study examines the predictive efficiency of several feature selection approaches in air quality models aimed to predict next-day PM2.5 concentrations in Shah Alam, Malaysia. Air pollution in urban areas is a significant public health concern, and accurate prediction models are essential for timely interventions. However, determining the most important parameters to include in these models remains difficult, especially in complex urban areas with several pollution sources. To address this, we employed three different feature selection methods and applied them to a dataset comprising 43,824 air quality data points provided by the Department of Environmental Malaysia. The data set contained ten variables, such as gas pollutants and meteorological indicators. Each feature selection approach determined top eight variables to include in a Radial Basis Function Neural Network (RBFNN) model. The results showed that ReliefF outperformed Lasso and mRMR in terms of accuracy, specificity, precision, F1 Score, and AUROC, making it the most effective feature selection method for this study. This study contributes to the body of knowledge on air quality modelling by emphasising the relevance of using proper feature selection techniques that are suited to the specific characteristics of the dataset and urban area. Furthermore, it proposes that future study should look into the use of ReliefF-RBFNN in other settings, such as suburban and rural areas, as well as hybrid feature selection approaches to improve prediction performance across several context. © (2024), (Science and Information Organization). All Rights Reserved.
ISSN:2158107X
DOI:10.14569/IJACSA.2024.0151148