A Comparative Study of Univariate and Multivariate Time Series Forecasting for CPO Prices Using Machine Learning Techniques

The Malaysian palm oil sector has significantly contributed to developing the domestic economy and the global palm oil market. However, the fluctuation in Crude Palm Oil (CPO) prices poses a significant risk to farmers, producers, traders, consumers, and others involved in CPO production and marketi...

Full description

Bibliographic Details
Published in:Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Main Author: Mohd Fuad J.N.F.D.; Ibrahim Z.; Adam N.L.; Mat Diah N.
Format: Conference paper
Language:English
Published: Springer Science and Business Media Deutschland GmbH 2024
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85175952373&doi=10.1007%2f978-981-99-7339-2_5&partnerID=40&md5=385e03bb59ac0fb2004b6ee6195214af
Description
Summary:The Malaysian palm oil sector has significantly contributed to developing the domestic economy and the global palm oil market. However, the fluctuation in Crude Palm Oil (CPO) prices poses a significant risk to farmers, producers, traders, consumers, and others involved in CPO production and marketing. An accurate CPO price forecasting technique is required to aid decision-making in risky and unpredictable scenarios. Hence, this project aims to compare the performances of four-time series forecasting models, including Multilayer Perceptron (MLP), Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), and CNN-LSTM, in the context of univariate and multivariate analysis for CPO prices in Malaysia. This research methodology is based on five phases: research understanding, data understanding, data preparation, modeling, and evaluation. Monthly CPO prices, the production and export volume of CPO, selected vegetable oil prices, crude oil prices, and monthly exchange rate data from January 2009 to December 2022 were utilized. The metrics evaluation of Root Mean Square Error (RMSE) and Mean Absolute Percentage Error (MAPE) were then performed to compare and evaluate the performance of the models. Experimental analysis indicates that the CNN model trained on a multivariate dataset with carefully selected significant independent variables outperformed other models. With a configuration of 500 epochs and early stopping, it achieved remarkable results compared to models trained using a univariate approach, boasting an RMSE of 245.611 and a MAPE of 7.13. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
ISSN:3029743
DOI:10.1007/978-981-99-7339-2_5