Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models
Dissolved oxygen (DO) is an important indicator of river health for environmental engineers and ecological scientists to understand the state of river health. This study aims to evaluate the reliability of four feature selector algorithms i.e., Boruta, genetic algorithm (GA), multivariate adaptive r...
Published in: | Marine Pollution Bulletin |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Elsevier Ltd
2021
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85110391473&doi=10.1016%2fj.marpolbul.2021.112639&partnerID=40&md5=2b2a4e0d46ffe7c396361a7c3a1caa2f |
id |
2-s2.0-85110391473 |
---|---|
spelling |
2-s2.0-85110391473 Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M. Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models 2021 Marine Pollution Bulletin 170 10.1016/j.marpolbul.2021.112639 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85110391473&doi=10.1016%2fj.marpolbul.2021.112639&partnerID=40&md5=2b2a4e0d46ffe7c396361a7c3a1caa2f Dissolved oxygen (DO) is an important indicator of river health for environmental engineers and ecological scientists to understand the state of river health. This study aims to evaluate the reliability of four feature selector algorithms i.e., Boruta, genetic algorithm (GA), multivariate adaptive regression splines (MARS), and extreme gradient boosting (XGBoost) to select the best suited predictor of the applied water quality (WQ) parameters; and compare four tree-based predictive models, namely, random forest (RF), conditional random forests (cForest), RANdom forest GEneRator (Ranger), and XGBoost to predict the changes of dissolved oxygen (DO) in the Klang River, Malaysia. The total features including 15 WQ parameters from monitoring site data and 7 hydrological components from remote sensing data. All predictive models performed well as per the features selected by the algorithms XGBoost and MARS in terms applied statistical evaluators. Besides, the best performance noted in case of XGBoost predictive model among all applied predictive models when the feature selected by MARS and XGBoost algorithms, with the coefficient of determination (R2) values of 0.84 and 0.85, respectively, nonetheless the marginal performance came up by Boruta-XGBoost model on in this scenario. © 2021 Elsevier Ltd Elsevier Ltd 0025326X English Article All Open Access; Bronze Open Access |
author |
Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M. |
spellingShingle |
Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M. Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models |
author_facet |
Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M. |
author_sort |
Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M. |
title |
Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models |
title_short |
Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models |
title_full |
Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models |
title_fullStr |
Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models |
title_full_unstemmed |
Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models |
title_sort |
Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models |
publishDate |
2021 |
container_title |
Marine Pollution Bulletin |
container_volume |
170 |
container_issue |
|
doi_str_mv |
10.1016/j.marpolbul.2021.112639 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85110391473&doi=10.1016%2fj.marpolbul.2021.112639&partnerID=40&md5=2b2a4e0d46ffe7c396361a7c3a1caa2f |
description |
Dissolved oxygen (DO) is an important indicator of river health for environmental engineers and ecological scientists to understand the state of river health. This study aims to evaluate the reliability of four feature selector algorithms i.e., Boruta, genetic algorithm (GA), multivariate adaptive regression splines (MARS), and extreme gradient boosting (XGBoost) to select the best suited predictor of the applied water quality (WQ) parameters; and compare four tree-based predictive models, namely, random forest (RF), conditional random forests (cForest), RANdom forest GEneRator (Ranger), and XGBoost to predict the changes of dissolved oxygen (DO) in the Klang River, Malaysia. The total features including 15 WQ parameters from monitoring site data and 7 hydrological components from remote sensing data. All predictive models performed well as per the features selected by the algorithms XGBoost and MARS in terms applied statistical evaluators. Besides, the best performance noted in case of XGBoost predictive model among all applied predictive models when the feature selected by MARS and XGBoost algorithms, with the coefficient of determination (R2) values of 0.84 and 0.85, respectively, nonetheless the marginal performance came up by Boruta-XGBoost model on in this scenario. © 2021 Elsevier Ltd |
publisher |
Elsevier Ltd |
issn |
0025326X |
language |
English |
format |
Article |
accesstype |
All Open Access; Bronze Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1809678158868250624 |