A weighted ensemble model combining ARIMA, LSTM, and GBM for robust time series prediction
https://doi.org/10.17586/2226-1494-2025-25-6-1150-1159
Abstract
Time series forecasting has been used in research and applications in a number of domains such as environmental forecasting, healthcare, finance, supply chain management, and energy consumption. Accurate prediction of future values is necessary for strategic planning operational efficiency and well-informed decision-making regarding time-dependent variables. A hybrid time series forecasting architecture is proposed that combines the strengths of machine learning and statistical models, in particular Gradient Boosting Machines (GBM), Auto-Regressive Integrated Moving Average (ARIMA), and Long Short-Term Memory (LSTM) networks. While LSTM networks and GBM are able to capture complex dependencies and nonlinear patterns, the ARIMA model captures the linear components within the time series. The hybrid model exploits ARIMA interpretability, LSTM temporal memory ability, and GBM ensemble learning efficiency by integrating these three models. Comprehensive experiments conducted on benchmark data sets have shown that the accuracy and reliability of predictions of the proposed hybridization significantly exceeds both individual models and traditional baseline models. The results show that for a variety of real-world applications, hybrid architectures can deliver reliable and accurate time series predictions.
About the Authors
A. VigneshIndia
Arumugam Vignesh, Scientific Researcher
600087; Chennai
sc 57219977662
N. Vijayalakshmi
India
Natarajan Vijayalakshmi, PhD, Associate Professor
600087; Chennai
sc 57212868093
References
1. Aghemo C., Blaso L., Pellegrino A. Building automation and control systems: A case study to evaluate the energy and environmental performances of a lighting control system in offices. Automation in Construction, 2014, vol. 43, pp. 10–22. doi: 10.1016/j.autcon.2014.02.015
2. Arumugam V., Natarajan V. Time series modeling and forecasting using Autoregressive Integrated Moving Average and Seasonal Autoregressive Integrated Moving Average models. Instrumentation Mesure Métrologie, 2023, vol. 22, no. 4, pp. 161–168. doi: 10.18280/i2m.220404
3. Beard E., Marsden J., Brown J., Tombor I., Stapleton J., Michie S., West R. Understanding and using time series analyses in addiction research. Addiction, 2019, vol. 114, no. 10, pp. 1866–1884. doi: 10.1111/add.14643
4. Bernal J.L., Soumerai S., Gasparrini A. A methodological framework for model selection in interrupted time series studies. Journal of Clinical Epidemiology, 2018, vol. 103, pp. 82–91. doi: 10.1016/j.jclinepi.2018.05.026
5. Chou J.-S., Tran D.-S. Forecasting energy consumption time series using machine learning techniques based on usage patterns of residential householders. Energy, 2018, vol.165, part B, pp. 709–726. doi: 10.1016/j.energy.2018.09.144
6. Chujai P., Kerdprasop N., Kerdprasop K. Time series analysis of household electric consumption with ARIMA and ARMA models. Proc. of the International MultiConference of Engineers and Computer Scientists (IMECS), 2013, vol. 1, pp. 1–6.
7. Corry E., Pauwels P., Hu S., Keane M., O’Donnell J. A performance assessment ontology for the environmental and energy management of buildings. Automation in Construction, 2015, vol. 57, pp. 249–259. doi: 10.1016/j.autcon.2015.05.002
8. Dey P., Chaulya S.K., Kumar S. Hybrid CNN-LSTM and IoT-based coal mine hazards monitoring and prediction system. Process Safety and Environmental Protection, 2021, vol. 152, pp. 249–263. doi: 10.1016/j.psep.2021.06.005
9. Hu Y., Peng L., Li X., Yao X., Lin H., Chi T. A novel evolution tree for analyzing the global energy consumption structure. Energy, 2018, vol. 147, pp. 1177–1187. doi: 10.1016/j.energy.2018.01.093
10. Li K., Huang W., Hu G., Li J. Ultra-short term power load forecasting based on CEEMDAN-SE and LSTM neural network. Energy and Buildings, 2023, vol. 279, pp. 112666. doi: 10.1016/j.enbuild.2022.112666
11. Lee Y.-S., Tong L.-I. Forecasting energy consumption using a grey model improved by incorporating genetic programming. Energy Conversion and Management, 2011, vol. 52, no. 1, pp.147–152. doi: 10.1016/j.enconman.2010.06.053
12. Liboschik T., Fokianos K., Fried R. tscount: An R package for analysis of count time series following generalized linear models. Journal of Statistical Software, 2017, vol. 82, no. 5, pp. 1–51. doi: 10.18637/jss.v082.i05
13. Zhao N., Zhang H., Yang X., Yan J., You F. Emerging information and communication technologies for smart energy systems and renewable transition. Advances in Applied Energy, 2023, vol. 9, pp. 100125. doi: 10.1016/j.adapen.2023.100125
14. Peleg R., R. Weiss, A. Hoogi A. Leveraging the triple exponential moving average for fast-adaptive moment estimation. arXiv, 2023, arXiv:2306.01423v1. doi: 10.48550/arXiv.2306.01423
15. Pérez-Lombard L., Ortiz J., Pout C. A review on buildings energy consumption information. Energy and Buildings, 2008. vol. 40, no. 3, pp. 394–398. doi: 10.1016/j.enbuild.2007.03.007
16. Pomorski P., Gorse D. Improving on the Markov-switching regression model by the use of an adaptive moving average. Springer Proceedings in Business and Economics, 2023, pp. 17–30. doi: 10.1007/978-3-031-23844-4_2
17. Mehtab S., Sen J. Stock price prediction using CNN and LSTM-based deep learning models. Proc. of the International Conference on Decision Aid Sciences and Application (DASA), 2020, pp. 447–453. doi: 10.1109/DASA51403.2020.9317207
18. Saleti S., Panchumarthi L.Y., Kallam Y.R., Parchuri L., Jitte S. Enhancing forecasting accuracy with a moving average-integrated hybrid ARIMA-LSTM model. SN Computer Science, 2024, vol. 5, no. 6, pp. 704. doi: 10.1007/s42979-024-03060-4
19. Xue S., Chen H., Zheng X. Detection and quantification of anomalies in communication networks based on LSTM-ARIMA combined model. International Journal of Machine Learning and Cybernetics, 2022, vol. 13, no. 10, pp. 3159–3172. doi: 10.1007/s13042-022-01586-8
20. Thangarajan B., Nagaraja M.S., Dhandra B.V. A novel hybrid model for time series forecasting using artificial neural network and autoregressive integrated moving average models. Trends in Mathematics, 2024, part F2357, pp. 747–754. doi: 10.1007/978-3-031-41420-6_62
21. He W., King M., Luo X., Dooner M., Li D., Wang J. Technologies and economics of electric energy storages in power systems : Reviewand perspective. Advances in Applied Energy, 2021, vol. 4, pp. 100060. doi: 10.1016/j.adapen.2021.100060
22. Xu W., Peng H., Zeng X., Zhou F., Tian X., Peng X. Deep belief network-based AR model for nonlinear time series forecasting. Applied Soft Computing, 2019, vol. 77, pp. 605–621. doi: 10.1016/j.asoc.2019.02.006
23. Yip S.-C., Wong K., Hew W.-P., Gan M.-T., Phan R.C.W., Tan S.-W. Detection of energy theft and defective smart meters in smart grids using linear regression. International Journal of Electrical Power & Energy Systems, 2017, vol. 91, pp. 230–240. doi: 10.1016/j.ijepes.2017.04.005
24. Young J.M., Stacey I., Dobbins T.A., Dunlop S., Dessaix A.L., Currow D.C. Association between tobacco plain packaging and Quitline calls: A population-based, interrupted time-series analysis. Medical Journal of Australia, 2014, vol. 200, no. 1, pp. 29–32. doi: 10.5694/mja13.11070
25. Zhou S., Wu Z., Li J., Zhang X.-P. Real-time energy control approach for smart home energy management system. Electric Power Components and Systems, 2014. vol. 42, no. 3–4, pp. 315–326. doi: 10.1080/15325008.2013.862322
26. Gidon J.S., Borah J., Sahoo S., Majumdar S. Neural network approaches for enhanced landslide prediction: A comparative study for Mawiongrim, Meghalaya, India. Natural Hazards, 2025, vol. 121, no. 3, pp. 3677–3699. doi: 10.1007/s11069-024-06948-9
27. Gidon J.S., Borah J., Sahoo S., Majumdar S., Fujita M. Bidirectional LSTM model for accurate and real-time landslide detection: A case study in Mawiongrim, Meghalaya, India. IEEE Internet of Things Journal, 2024, vol. 11, no. 3, pp. 3792–3800. doi: 10.1109/JIOT.2023.3326203
28. Arumugam V., Natarajan V. Enhanced time series forecasting using hybrid ARIMA and machine learning models. Indonesian Journal of Electrical Engineering and Computer Science, 2025, vol. 38, no. 3, pp. 1970–1979. doi: 10.11591/ijeecs.v38.i3.pp1970-1979
29. Nadzir M.S.M., Rabuan U., Ali S.H.M., Borah J., Majumdar S., Rohmad M.S. Drone-based air quality monitoring: Development and evaluation of low-cost PM2.5 sensor for remote environmental assessment. Sensors and Materials, 2025, vol. 37, no. 6, pp. 2153–2171. doi: 10.18494/sam5449
Review
For citations:
Vignesh A., Vijayalakshmi N. A weighted ensemble model combining ARIMA, LSTM, and GBM for robust time series prediction. Scientific and Technical Journal of Information Technologies, Mechanics and Optics. 2025;25(6):1150-1159. https://doi.org/10.17586/2226-1494-2025-25-6-1150-1159































