Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter

Air pollution is a considerable health danger to the environment. The objective of this study was to assess the characteristics of air quality and predict PM10 concentrations using boosted regression trees (BRTs). The maximum daily PM10 concentration data from 2002 to 2016 were obtained from the air...

Full description

Bibliographic Details
Published in:International Journal of Environmental Science and Development
Main Author: Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z.
Format: Article
Language:English
Published: International Journal of Environmental Science and Development 2021
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85103188337&doi=10.18178%2fIJESD.2021.12.4.1329&partnerID=40&md5=26051f7753d2e4fb553f27c75b472b3b
id 2-s2.0-85103188337
spelling 2-s2.0-85103188337
Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z.
Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter
2021
International Journal of Environmental Science and Development
12
4
10.18178/IJESD.2021.12.4.1329
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85103188337&doi=10.18178%2fIJESD.2021.12.4.1329&partnerID=40&md5=26051f7753d2e4fb553f27c75b472b3b
Air pollution is a considerable health danger to the environment. The objective of this study was to assess the characteristics of air quality and predict PM10 concentrations using boosted regression trees (BRTs). The maximum daily PM10 concentration data from 2002 to 2016 were obtained from the air quality monitoring station in Kuching, Sarawak. Eighty percent of the monitoring records were used for the training and twenty percent for the validation of the models. The best iteration of the BRT model was performed by optimizing the prediction performance, while the BRT algorithm model was constructed from multiple regression models. The two main parameters that were used were the learning rate (lr) and tree complexity (tc), which were fixed at 0.01 and 5, respectively. Meanwhile, the number of trees (nt) was determined by using an independent test set (test), a 5-fold cross validation (CV) and out-of-bag (OOB) estimation. The algorithm model for the BRT produced by using the CV was the best guide to be used compared with the OOB to test the predicted PM10 concentration. The performance indicators showed that the model was adequate for the next day’s prediction (PA=0.638, R2=0.427, IA=0.749, NAE=0.267, and RMSE=28.455). Copyright © 2021 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
International Journal of Environmental Science and Development
20100264
English
Article
All Open Access; Gold Open Access
author Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z.
spellingShingle Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z.
Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter
author_facet Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z.
author_sort Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z.
title Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter
title_short Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter
title_full Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter
title_fullStr Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter
title_full_unstemmed Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter
title_sort Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter
publishDate 2021
container_title International Journal of Environmental Science and Development
container_volume 12
container_issue 4
doi_str_mv 10.18178/IJESD.2021.12.4.1329
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85103188337&doi=10.18178%2fIJESD.2021.12.4.1329&partnerID=40&md5=26051f7753d2e4fb553f27c75b472b3b
description Air pollution is a considerable health danger to the environment. The objective of this study was to assess the characteristics of air quality and predict PM10 concentrations using boosted regression trees (BRTs). The maximum daily PM10 concentration data from 2002 to 2016 were obtained from the air quality monitoring station in Kuching, Sarawak. Eighty percent of the monitoring records were used for the training and twenty percent for the validation of the models. The best iteration of the BRT model was performed by optimizing the prediction performance, while the BRT algorithm model was constructed from multiple regression models. The two main parameters that were used were the learning rate (lr) and tree complexity (tc), which were fixed at 0.01 and 5, respectively. Meanwhile, the number of trees (nt) was determined by using an independent test set (test), a 5-fold cross validation (CV) and out-of-bag (OOB) estimation. The algorithm model for the BRT produced by using the CV was the best guide to be used compared with the OOB to test the predicted PM10 concentration. The performance indicators showed that the model was adequate for the next day’s prediction (PA=0.638, R2=0.427, IA=0.749, NAE=0.267, and RMSE=28.455). Copyright © 2021 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
publisher International Journal of Environmental Science and Development
issn 20100264
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809678028113969152