Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter
Air pollution is a considerable health danger to the environment. The objective of this study was to assess the characteristics of air quality and predict PM10 concentrations using boosted regression trees (BRTs). The maximum daily PM10 concentration data from 2002 to 2016 were obtained from the air...
Published in: | International Journal of Environmental Science and Development |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
International Journal of Environmental Science and Development
2021
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85103188337&doi=10.18178%2fIJESD.2021.12.4.1329&partnerID=40&md5=26051f7753d2e4fb553f27c75b472b3b |
id |
2-s2.0-85103188337 |
---|---|
spelling |
2-s2.0-85103188337 Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z. Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter 2021 International Journal of Environmental Science and Development 12 4 10.18178/IJESD.2021.12.4.1329 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85103188337&doi=10.18178%2fIJESD.2021.12.4.1329&partnerID=40&md5=26051f7753d2e4fb553f27c75b472b3b Air pollution is a considerable health danger to the environment. The objective of this study was to assess the characteristics of air quality and predict PM10 concentrations using boosted regression trees (BRTs). The maximum daily PM10 concentration data from 2002 to 2016 were obtained from the air quality monitoring station in Kuching, Sarawak. Eighty percent of the monitoring records were used for the training and twenty percent for the validation of the models. The best iteration of the BRT model was performed by optimizing the prediction performance, while the BRT algorithm model was constructed from multiple regression models. The two main parameters that were used were the learning rate (lr) and tree complexity (tc), which were fixed at 0.01 and 5, respectively. Meanwhile, the number of trees (nt) was determined by using an independent test set (test), a 5-fold cross validation (CV) and out-of-bag (OOB) estimation. The algorithm model for the BRT produced by using the CV was the best guide to be used compared with the OOB to test the predicted PM10 concentration. The performance indicators showed that the model was adequate for the next day’s prediction (PA=0.638, R2=0.427, IA=0.749, NAE=0.267, and RMSE=28.455). Copyright © 2021 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0). International Journal of Environmental Science and Development 20100264 English Article All Open Access; Gold Open Access |
author |
Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z. |
spellingShingle |
Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z. Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter |
author_facet |
Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z. |
author_sort |
Shaziayani W.N.; Ul-Saufie A.Z.; Yusoff S.A.M.; Ahmat H.; Libasin Z. |
title |
Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter |
title_short |
Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter |
title_full |
Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter |
title_fullStr |
Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter |
title_full_unstemmed |
Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter |
title_sort |
Evaluation of boosted regression tree for the prediction of the maximum 24-hour concentration of particulate matter |
publishDate |
2021 |
container_title |
International Journal of Environmental Science and Development |
container_volume |
12 |
container_issue |
4 |
doi_str_mv |
10.18178/IJESD.2021.12.4.1329 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85103188337&doi=10.18178%2fIJESD.2021.12.4.1329&partnerID=40&md5=26051f7753d2e4fb553f27c75b472b3b |
description |
Air pollution is a considerable health danger to the environment. The objective of this study was to assess the characteristics of air quality and predict PM10 concentrations using boosted regression trees (BRTs). The maximum daily PM10 concentration data from 2002 to 2016 were obtained from the air quality monitoring station in Kuching, Sarawak. Eighty percent of the monitoring records were used for the training and twenty percent for the validation of the models. The best iteration of the BRT model was performed by optimizing the prediction performance, while the BRT algorithm model was constructed from multiple regression models. The two main parameters that were used were the learning rate (lr) and tree complexity (tc), which were fixed at 0.01 and 5, respectively. Meanwhile, the number of trees (nt) was determined by using an independent test set (test), a 5-fold cross validation (CV) and out-of-bag (OOB) estimation. The algorithm model for the BRT produced by using the CV was the best guide to be used compared with the OOB to test the predicted PM10 concentration. The performance indicators showed that the model was adequate for the next day’s prediction (PA=0.638, R2=0.427, IA=0.749, NAE=0.267, and RMSE=28.455). Copyright © 2021 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0). |
publisher |
International Journal of Environmental Science and Development |
issn |
20100264 |
language |
English |
format |
Article |
accesstype |
All Open Access; Gold Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1809678028113969152 |