Predicting Ocean Current Temperature Off the East Coast of America with XGBoost and Random Forest Algorithms Using Rstudio

This research investigates the comparative predictive efficacy of two leading machine learning methodologies, specifically the XGBoost and Random Forest models, in estimating ocean temperature dynamics in the TS Gulf Stream and Labrador Current regions along the east coast of North America. Using an...

Full description

Bibliographic Details
Published in:Ilmu Kelautan: Indonesian Journal of Marine Sciences
Main Authors: Alfaris L.; Firdaus A.N.; Nyuswantoro U.I.; Siagian R.C.; Muhammad A.C.; Hassan R.; Aunzo R.T., Jr.; Ariefka R.
Format: Article
Language:English
Published: Diponegoro University 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85198048471&doi=10.14710%2fik.ijms.29.2.273-284&partnerID=40&md5=4b50e7e26a84ca531d26655f43c54794
id 2-s2.0-85198048471
spelling 2-s2.0-85198048471
Alfaris L.; Firdaus A.N.; Nyuswantoro U.I.; Siagian R.C.; Muhammad A.C.; Hassan R.; Aunzo R.T., Jr.; Ariefka R.
Predicting Ocean Current Temperature Off the East Coast of America with XGBoost and Random Forest Algorithms Using Rstudio
2024
Ilmu Kelautan: Indonesian Journal of Marine Sciences
29
2
10.14710/ik.ijms.29.2.273-284
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85198048471&doi=10.14710%2fik.ijms.29.2.273-284&partnerID=40&md5=4b50e7e26a84ca531d26655f43c54794
This research investigates the comparative predictive efficacy of two leading machine learning methodologies, specifically the XGBoost and Random Forest models, in estimating ocean temperature dynamics in the TS Gulf Stream and Labrador Current regions along the east coast of North America. Using annual temperature datasets and relevant oceanographic parameters, the data is carefully processed, cleaned and sorted into training and test subsets via the RStudio Platform. The performance evaluation model is carried out using predetermined machine learning assessment criteria, including Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Mean Squared Error (MSE), and R-squared. The results show the superiority of the XGBoost model compared to Random Forest in terms of prediction accuracy and minimizing prediction errors. The XGBoost model shows lower MSE values and higher R-squared values than the Random Forest model, indicating its better capacity in explaining data variations. XGBoost consistently provides more accurate predictions and shows higher sensitivity in identifying important factors influencing ocean temperature fluctuations than Random Forest. This research significantly improves understanding and prognostic capabilities regarding ocean temperature dynamics in the TS Gulf Stream and Labrador Current regions. Empirical evidence underlines the efficacy of the XGBoost model in predicting ocean temperatures in the studied region. Continuous model evaluation and parameter refinement for both methodologies is critical to establishing standards for optimal prediction performance. The findings of this research have implications for the fields of oceanography and climate science, and offer potential pathways to comprehensively understand and mitigate the impacts of climate change on marine ecosystems. © Ilmu Kelautan, UNDIP.
Diponegoro University
8537291
English
Article
All Open Access; Gold Open Access
author Alfaris L.; Firdaus A.N.; Nyuswantoro U.I.; Siagian R.C.; Muhammad A.C.; Hassan R.; Aunzo R.T.
Jr.; Ariefka R.
spellingShingle Alfaris L.; Firdaus A.N.; Nyuswantoro U.I.; Siagian R.C.; Muhammad A.C.; Hassan R.; Aunzo R.T.
Jr.; Ariefka R.
Predicting Ocean Current Temperature Off the East Coast of America with XGBoost and Random Forest Algorithms Using Rstudio
author_facet Alfaris L.; Firdaus A.N.; Nyuswantoro U.I.; Siagian R.C.; Muhammad A.C.; Hassan R.; Aunzo R.T.
Jr.; Ariefka R.
author_sort Alfaris L.; Firdaus A.N.; Nyuswantoro U.I.; Siagian R.C.; Muhammad A.C.; Hassan R.; Aunzo R.T.
title Predicting Ocean Current Temperature Off the East Coast of America with XGBoost and Random Forest Algorithms Using Rstudio
title_short Predicting Ocean Current Temperature Off the East Coast of America with XGBoost and Random Forest Algorithms Using Rstudio
title_full Predicting Ocean Current Temperature Off the East Coast of America with XGBoost and Random Forest Algorithms Using Rstudio
title_fullStr Predicting Ocean Current Temperature Off the East Coast of America with XGBoost and Random Forest Algorithms Using Rstudio
title_full_unstemmed Predicting Ocean Current Temperature Off the East Coast of America with XGBoost and Random Forest Algorithms Using Rstudio
title_sort Predicting Ocean Current Temperature Off the East Coast of America with XGBoost and Random Forest Algorithms Using Rstudio
publishDate 2024
container_title Ilmu Kelautan: Indonesian Journal of Marine Sciences
container_volume 29
container_issue 2
doi_str_mv 10.14710/ik.ijms.29.2.273-284
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85198048471&doi=10.14710%2fik.ijms.29.2.273-284&partnerID=40&md5=4b50e7e26a84ca531d26655f43c54794
description This research investigates the comparative predictive efficacy of two leading machine learning methodologies, specifically the XGBoost and Random Forest models, in estimating ocean temperature dynamics in the TS Gulf Stream and Labrador Current regions along the east coast of North America. Using annual temperature datasets and relevant oceanographic parameters, the data is carefully processed, cleaned and sorted into training and test subsets via the RStudio Platform. The performance evaluation model is carried out using predetermined machine learning assessment criteria, including Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Mean Squared Error (MSE), and R-squared. The results show the superiority of the XGBoost model compared to Random Forest in terms of prediction accuracy and minimizing prediction errors. The XGBoost model shows lower MSE values and higher R-squared values than the Random Forest model, indicating its better capacity in explaining data variations. XGBoost consistently provides more accurate predictions and shows higher sensitivity in identifying important factors influencing ocean temperature fluctuations than Random Forest. This research significantly improves understanding and prognostic capabilities regarding ocean temperature dynamics in the TS Gulf Stream and Labrador Current regions. Empirical evidence underlines the efficacy of the XGBoost model in predicting ocean temperatures in the studied region. Continuous model evaluation and parameter refinement for both methodologies is critical to establishing standards for optimal prediction performance. The findings of this research have implications for the fields of oceanography and climate science, and offer potential pathways to comprehensively understand and mitigate the impacts of climate change on marine ecosystems. © Ilmu Kelautan, UNDIP.
publisher Diponegoro University
issn 8537291
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1818940556267487232