A Machine Learning Framework for Monthly Crude Oil Price Prediction with Cat Boost
DOI:
https://doi.org/10.63665/emkx0127Keywords:
Crude Oil Price Prediction, Machine Learning, CatBoost Regressor, Time Series Forecasting, Energy Economics, Predictive Analytics, Gradient Boosting, Historical Market Data, Feature Engineering, Rolling Statistics, Lag Features, Seasonal Analysis, RMSE, MAE, MAPE, Financial Forecasting, Oil Market Analysis, Data Science, Artificial Intelligence, Economic Prediction.Abstract
Crude oil is a globally significant energy resource whose price fluctuations have far-reaching economic and industrial impacts. Accurate forecasting of crude oil prices is crucial for strategic decision-making in sectors such as finance, energy, and transportation.
This project presents a machine learning-based approach to predict monthly crude oil prices using historical market data and engineered time-series features. The model is developed using the CatBoost Regressor, a high-performance gradient boosting algorithm known for its efficiency, accuracy, and ability to handle complex non-linear data.
The predictive features include: Lagged prices from previous months Rolling statistical indicators (mean and standard deviation) Temporal features such as month and year (encoded using sine and cosine transformations to preserve seasonality) Both percentage and absolute monthly price changes, The dataset spans over four decades (1983–2025), ensuring that the model captures long-term patterns and short-term fluctuations. Model performance is evaluated using RMSE, MAE, and MAPE metrics, demonstrating strong predictive accuracy and generalization capability.
Downloads
References
[1] G. Wu and Y.-J. Zhang, 'Does China factor matter? An econometric analysis of international crude oil prices,' Energy Policy, vol. 72, pp. 78–86, 2014.
[2] H. Mohammadi and L. Su, 'International evidence on crude oil price dynamics: Applications of ARIMA-GARCH models,' Energy Econ., vol. 32, no. 5, pp. 1001–1008, 2010.
[3] T. Yao and Y.-J. Zhang, 'Forecasting crude oil prices with the Google index,' Energy Proc., vol. 105, pp. 3772–3776, 2017.
[4] H. Chiroma et al., 'Evolutionary neural network model for west Texas intermediate crude oil price prediction,' Appl. Energy, vol. 142, pp. 266–273, 2015.
[5] Y. Zhao, J. Li, and L. Yu, 'A deep learning ensemble approach for crude oil price forecasting,' Energy Econ., vol. 66, pp. 9–16, 2017.
[6] M. Wang et al., 'A novel hybrid method of forecasting crude oil prices using complex network science and artificial intelligence algorithms,' Appl. Energy, vol. 220, pp. 480–495, 2018.
[7] H. He et al., 'A novel crude oil price trend prediction method: Machine learning classification algorithm based on multi-modal data features,' Energy, vol. 244, 2022.
[8] R. Li et al., 'A novel multiscale forecasting model for crude oil price time series,' Technol. Forecasting Social Change, vol. 173, 2021.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Authors

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
