Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter

Air pollution is a considerable health danger to the environment. The objective of this study was to assess the characteristics of air quality and predict PM10 concentrations using boosted regression trees (BRTs). The maximum daily PM10 concentration data from 2002 to 2016 were obtained from the air...

Full description

Bibliographic Details
Published in:International Journal of Environmental Science and Development
Main Author: Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z.
Format: Article
Language:English
Published: International Journal of Environmental Science and Development 2021
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85103188337&doi=10.18178%2fIJESD.2021.12.4.1329&partnerID=40&md5=26051f7753d2e4fb553f27c75b472b3b
Description
Summary:Air pollution is a considerable health danger to the environment. The objective of this study was to assess the characteristics of air quality and predict PM10 concentrations using boosted regression trees (BRTs). The maximum daily PM10 concentration data from 2002 to 2016 were obtained from the air quality monitoring station in Kuching, Sarawak. Eighty percent of the monitoring records were used for the training and twenty percent for the validation of the models. The best iteration of the BRT model was performed by optimizing the prediction performance, while the BRT algorithm model was constructed from multiple regression models. The two main parameters that were used were the learning rate (lr) and tree complexity (tc), which were fixed at 0.01 and 5, respectively. Meanwhile, the number of trees (nt) was determined by using an independent test set (test), a 5-fold cross validation (CV) and out-of-bag (OOB) estimation. The algorithm model for the BRT produced by using the CV was the best guide to be used compared with the OOB to test the predicted PM10 concentration. The performance indicators showed that the model was adequate for the next day’s prediction (PA=0.638, R2=0.427, IA=0.749, NAE=0.267, and RMSE=28.455). Copyright © 2021 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
ISSN:20100264
DOI:10.18178/IJESD.2021.12.4.1329