Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models

Dissolved oxygen (DO) is an important indicator of river health for environmental engineers and ecological scientists to understand the state of river health. This study aims to evaluate the reliability of four feature selector algorithms i.e., Boruta, genetic algorithm (GA), multivariate adaptive r...

Full description

Bibliographic Details
Published in:Marine Pollution Bulletin
Main Author: Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M.
Format: Article
Language:English
Published: Elsevier Ltd 2021
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85110391473&doi=10.1016%2fj.marpolbul.2021.112639&partnerID=40&md5=2b2a4e0d46ffe7c396361a7c3a1caa2f
id 2-s2.0-85110391473
spelling 2-s2.0-85110391473
Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M.
Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models
2021
Marine Pollution Bulletin
170

10.1016/j.marpolbul.2021.112639
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85110391473&doi=10.1016%2fj.marpolbul.2021.112639&partnerID=40&md5=2b2a4e0d46ffe7c396361a7c3a1caa2f
Dissolved oxygen (DO) is an important indicator of river health for environmental engineers and ecological scientists to understand the state of river health. This study aims to evaluate the reliability of four feature selector algorithms i.e., Boruta, genetic algorithm (GA), multivariate adaptive regression splines (MARS), and extreme gradient boosting (XGBoost) to select the best suited predictor of the applied water quality (WQ) parameters; and compare four tree-based predictive models, namely, random forest (RF), conditional random forests (cForest), RANdom forest GEneRator (Ranger), and XGBoost to predict the changes of dissolved oxygen (DO) in the Klang River, Malaysia. The total features including 15 WQ parameters from monitoring site data and 7 hydrological components from remote sensing data. All predictive models performed well as per the features selected by the algorithms XGBoost and MARS in terms applied statistical evaluators. Besides, the best performance noted in case of XGBoost predictive model among all applied predictive models when the feature selected by MARS and XGBoost algorithms, with the coefficient of determination (R2) values of 0.84 and 0.85, respectively, nonetheless the marginal performance came up by Boruta-XGBoost model on in this scenario. © 2021 Elsevier Ltd
Elsevier Ltd
0025326X
English
Article
All Open Access; Bronze Open Access
author Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M.
spellingShingle Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M.
Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models
author_facet Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M.
author_sort Tiyasha T.; Tung T.M.; Bhagat S.K.; Tan M.L.; Jawad A.H.; Mohtar W.H.M.W.; Yaseen Z.M.
title Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models
title_short Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models
title_full Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models
title_fullStr Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models
title_full_unstemmed Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models
title_sort Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models
publishDate 2021
container_title Marine Pollution Bulletin
container_volume 170
container_issue
doi_str_mv 10.1016/j.marpolbul.2021.112639
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85110391473&doi=10.1016%2fj.marpolbul.2021.112639&partnerID=40&md5=2b2a4e0d46ffe7c396361a7c3a1caa2f
description Dissolved oxygen (DO) is an important indicator of river health for environmental engineers and ecological scientists to understand the state of river health. This study aims to evaluate the reliability of four feature selector algorithms i.e., Boruta, genetic algorithm (GA), multivariate adaptive regression splines (MARS), and extreme gradient boosting (XGBoost) to select the best suited predictor of the applied water quality (WQ) parameters; and compare four tree-based predictive models, namely, random forest (RF), conditional random forests (cForest), RANdom forest GEneRator (Ranger), and XGBoost to predict the changes of dissolved oxygen (DO) in the Klang River, Malaysia. The total features including 15 WQ parameters from monitoring site data and 7 hydrological components from remote sensing data. All predictive models performed well as per the features selected by the algorithms XGBoost and MARS in terms applied statistical evaluators. Besides, the best performance noted in case of XGBoost predictive model among all applied predictive models when the feature selected by MARS and XGBoost algorithms, with the coefficient of determination (R2) values of 0.84 and 0.85, respectively, nonetheless the marginal performance came up by Boruta-XGBoost model on in this scenario. © 2021 Elsevier Ltd
publisher Elsevier Ltd
issn 0025326X
language English
format Article
accesstype All Open Access; Bronze Open Access
record_format scopus
collection Scopus
_version_ 1809678158868250624