A Hybrid Transformer-MLP Approach for Short-Term Electric Load Forecasting

Authors

DOI:

https://doi.org/10.18196/jrc.v6i4.26960

Keywords:

Short-Term Load Forecasting, Hybrid Deep Learning Model, Transformer, MLP

Abstract

Short-term electric load forecasting plays a vital role in ensuring the stability and efficiency of smart grid operations. However, accurately predicting demand remains challenging due to nonlinearity, volatility, and long-term temporal dependencies in consumption patterns. The research proposes a lightweight hybrid deep learning model that integrates a Transformer encoder with a multi-layer perceptron (MLP) to enhance prediction accuracy and robustness for short-term load forecasting. The proposed model employs a Transformer to extract long-range temporal features through self-attention mechanisms, while the MLP captures complex nonlinear mappings at the output stage. A real-world electricity load dataset collected from three Australian states (NSW, QLD, VIC) between 2009 and 2014 is used for evaluation. To assess model performance, mean absolute percentage error (MAPE), mean squared error (MSE), and Root Mean Squared Error (RMSE) are used. Experimental results demonstrate that the proposed transformer-MLP model consistently achieves the lowest forecasting error across all regions. MAPE ranges from 0.69% to 0.95%, outperforming standard deep learning models, including LSTM, CNN, and MLP. Despite its shallow architecture and reduced computational complexity, the hybrid model effectively captures both temporal dependencies and nonlinear variations. This study provides a practical, deployable forecasting solution for smart grids. Future work will extend the model to multi-step forecasting, incorporate exogenous variables such as weather and calendar effects, and explore deeper Transformer variants further to enhance prediction accuracy and generalization across diverse load conditions.

References

M. Abdurohman and A. G. Putrada, “Forecasting Model for Lighting Electricity Load with a Limited Dataset using XGBoost,” Kinetik: Game Technology, Information System, Computer Network, Computing, Electronics, and Control, vol. 8, no. 2, pp. 571–580, 2023, doi: 10.22219/kinetik.v8i2.1687.

J. Mo, R. Wang, M. Cao, K. Yang, X. Yang, and T. Zhang, “A hybrid temporal convolutional network and Prophet model for power load forecasting,” Complex and Intelligent Systems, vol. 9, no. 4, pp. 4249–4261, 2023, doi: 10.1007/s40747-022-00952-x.

X. Guo, Q. Zhao, D. Zheng, Y. Ning, and Y. Gao, “A short-term load forecasting model of multi-scale CNN-LSTM hybrid neural network considering the real-time electricity price,” Energy Reports, vol. 6, pp. 1046–1053, 2020, doi: 10.1016/j.egyr.2020.11.078.

A. Mansouri, A. H. Abolmasoumi, and A. A. Ghadimi, “Weather sensitive short term load forecasting using dynamic mode decomposition with control,” Electric Power Systems Research, vol. 221, p. 109387, 2023, doi: 10.1016/j.epsr.2023.109387.

R. Lu et al., “A novel sequence-to-sequence-based deep learning model for multistep load forecasting,” IEEE Transactions on Neural Networks and Learning Systems, vol. 36, no. 1, pp. 638–652, 2025, doi: 10.1109/TNNLS.2023.3329466.

P. D. Lê et al., “Applying statistical analysis for assessing the reliability of input data to improve the quality of short-term load forecasting for a Ho Chi Minh City distribution network,” Science & Technology Development Journal - Engineering and Technology, vol. 2, no. 4, pp. 223–239, 2020, doi: 10.32508/stdjet.v2i4.614.

S. Aoufi, A. Derhab, M. Guerroumi, H. Guemmouma, and H. Lazali, “LITE-FORT: Lightweight three-stage energy theft detection based on time series forecasting of consumption patterns,” Electric Power Systems Research, vol. 225, p. 109840, 2023, doi: 10.1016/j.epsr.2023.109840.

Q. Liu, J. Cao, J. Zhang, Y. Zhong, T. Ba, and Y. Zhang, “Short-Term Power Load Forecasting in FGSM-Bi-LSTM Networks Based on Empirical Wavelet Transform,” IEEE Access, vol. 11, pp. 105057–105068, 2023, doi: 10.1109/ACCESS.2023.3316516.

E. A. Siqueira-Filho, M. F. A. Lira, A. Converti, H. V. Siqueira, and C. J. A. Bastos-Filho, “Predicting Thermoelectric Power Plants Diesel/Heavy Fuel Oil Engine Fuel Consumption Using Univariate Forecasting and XGBoost Machine Learning Models,” Energies, vol. 16, no. 7, 2023, doi: 10.3390/en16072942.

Y. Miao, Z. Chen, J. Zhu, S. Li, H. Dong, and X. Wen, “Short-term Load Forecasting Based on Echo State Network and

LightGBM,” in 2023 IEEE International Conference on Predictive Control of Electrical Drives and Power Electronics, PRECEDE 2023, no. 52177087, pp. 1–6, 2023, doi: 10.1109/PRECEDE57319.2023.10174609.

A. Ghasemieh, A. Lloyed, P. Bahrami, P. Vajar, and R. Kashef, “A novel machine learning model with Stacking Ensemble Learner for predicting emergency readmission of heart-disease patients,” Decision Analytics Journal, vol. 7, p. 100242, 2023, doi: 10.1016/j.dajour.2023.100242.

J. Luo, Y. Zheng, T. Hong, A. Luo, and X. Yang, “Fuzzy support vector regressions for short-term load forecasting,” Fuzzy Optimization and Decision Making, vol. 23, no. 3, pp. 363–385, 2024, doi: 10.1007/s10700-024-09425-x.

B. Chen and Y. Wang, “Short-Term Electric Load Forecasting of Integrated Energy System Considering Nonlinear Synergy between Different Loads,” IEEE Access, vol. 9, pp. 43562–43573, 2021, doi: 10.1109/ACCESS.2021.3066915.

H. Hou et al., “Load Forecasting Combining Phase Space Reconstruction and Stacking Ensemble Learning,” IEEE Transactions on Industry Applications, vol. 59, no. 2, pp. 2296–2304, 2023, doi: 10.1109/TIA.2022.3225516.

X. Yang and Z. Chen, “A Hybrid Short-Term Load Forecasting Model Based on CatBoost and LSTM,” in 2021 IEEE 6th International Conference on Intelligent Computing and Signal Processing, ICSP 2021, pp. 328–332, 2021, doi: 10.1109/ICSP51882.2021.9408768.

S. Atef and A. B. Eltawil, “Assessment of stacked unidirectional and bidirectional long short-term memory networks for electricity load forecasting,” Electric Power Systems Research, vol. 187, p. 106489, 2020, doi: 10.1016/j.epsr.2020.106489.

V. K. Saini, A. S. Al-Sumaiti, and R. Kumar, “Data driven net load uncertainty quantification for cloud energy storage management in residential microgrid,” Electric Power Systems Research, vol. 226, p. 109920, 2024, doi: 10.1016/j.epsr.2023.109920.

Z. Wen, L. Xie, Q. Fan, and H. Feng, “Long term electric load forecasting based on TS-type recurrent fuzzy neural network model,” Electric Power Systems Research, vol. 179, p. 106106, 2020, doi: 10.1016/j.epsr.2019.106106.

H. Liu, Y. Tang, Y. Pu, F. Mei, and D. Sidorov, “Short-term Load Forecasting of Multi-Energy in Integrated Energy System Based on Multivariate Phase Space Reconstruction and Support Vector Regression Mode,” Electric Power Systems Research, vol. 210, p. 108066, 2022, doi: 10.1016/j.epsr.2022.108066.

G. F. Fan, Y. R. Liu, H. Z. Wei, M. Yu, and Y. H. Li, “The new hybrid approaches to forecasting short-term electricity load,” Electric Power Systems Research, vol. 213, p. 108759, 2022, doi: 10.1016/j.epsr.2022.108759.

S. Rai and M. De, “Ensemble-based Load Forecasting for Smart Metered System,” in 2022 2nd International Conference on Emerging Frontiers in Electrical and Electronic Technologies, ICEFEET 2022, pp. 1–4, 2022, doi: 10.1109/ICEFEET51821.2022.9847955.

M. Li and Y. Wang, “Power load forecasting and interpretable models based on GS_XGBoost and SHAP,” Journal of Physics: Conference Series, vol. 2195, no. 1, 2022, doi: 10.1088/1742-6596/2195/1/012028.

G. Yan, J. Wang, and M. Thwin, “A new Frontier in electric load forecasting: The LSV/MOPA model optimized by modified orca predation algorithm,” Heliyon, vol. 10, no. 2, p. e24183, 2024, doi: 10.1016/j.heliyon.2024.e24183.

I. U. Khalil, A. Ul Haq, and N. Ul Islam, “A deep learning-based transformer model for photovoltaic fault forecasting and classification,” Electric Power Systems Research, vol. 228, p. 110063, 2024, doi: 10.1016/j.epsr.2023.110063.

M. S. Tahsin, M. Al Karim, M. U. Ahmed, Y. Rahman, F. Tafannum, and S. Abdullah, “Comparative Analysis of Weather Prediction Using Ensemble Learning Models and Neural Network,” in Proceedings - 2021 19th OITS International Conference on Information Technology, OCIT 2021, pp. 325–330, 2021, doi: 10.1109/OCIT53463.2021.00071.

M. Dostmohammadi, M. Z. Pedram, S. Hoseinzadeh, and D. A. Garcia, “A GA-stacking ensemble approach for forecasting energy consumption in a smart household: A comparative study of ensemble methods,” Journal of Environmental Management, vol. 364, p. 121264, 2024, doi: 10.1016/j.jenvman.2024.121264.

S. Singh, A. Yassine, and R. Benlamri, “Internet of Energy: Ensemble Learning through Multilevel Stacking for Load Forecasting,” in Proceedings - IEEE 18th International Conference on Dependable, Autonomic and Secure Computing, IEEE 18th International Conference on Pervasive Intelligence and Computing, IEEE 6th International Conference on Cloud and Big Data Computing and IEEE 5th Cyber Science and Technology Congress, DASC/PiCom/CBDCom/CyberSciTech 2020, pp. 658–664, 2020, doi: 10.1109/DASC-PICom-CBDCom-CyberSciTech49142.2020.00113.

J. H. Kim, B. S. Lee, and C. H. Kim, “A Study on the development of long-term hybrid electrical load forecasting model based on MLP and statistics using massive actual data considering field applications,” Electric Power Systems Research, vol. 221, p. 109415, 2023, doi: 10.1016/j.epsr.2023.109415.

A. Faustine, N. J. Nunes, and L. Pereira, “Efficiency through simplicity: MLP-based approach for net-load forecasting with uncertainty estimates in low-voltage distribution networks,” IEEE Transactions on Power Systems, vol. 40, no. 1, pp. 46–56, 2025, doi: 10.1109/TPWRS.2024.3400123.

A. P. Wibawa, A. B. P. Utama, H. Elmunsyah, U. Pujianto, F. A. Dwiyanto, and L. Hernandez, “Time-series analysis with smoothed Convolutional Neural Network,” Journal of Big Data, vol. 9, no. 1, 2022, doi: 10.1186/s40537-022-00599-y.

I. C. Figueiró, A. R. Abaide, N. K. Neto, L. N. F. Silva, and L. L. C. Santos, “Bottom-Up Short-Term Load Forecasting Considering Macro-Region and Weighting by Meteorological Region,” Energies, vol. 16, no. 19, pp. 1191–1198, 2023, doi: 10.3390/en16196857.

S. Ryu and Y. Yu, “Quantile-Mixer: A Novel Deep Learning Approach for Probabilistic Short-Term Load Forecasting,” IEEE Transactions on Smart Grid, vol. 15, no. 2, pp. 2237–2250, 2024, doi: 10.1109/TSG.2023.3290180.

S. S. Subbiah and J. Chinnappan, “Deep learning based short term load forecasting with hybrid feature selection,” Electric Power Systems Research, vol. 210, p. 108065, 2022, doi: 10.1016/j.epsr.2022.108065.

A. Ajitha, M. Goel, M. Assudani, S. Radhika, and S. Goel, “Design and development of Residential Sector Load Prediction model during COVID-19 Pandemic using LSTM based RNN,” Electric Power Systems Research, vol. 212, p. 108635, 2022, doi: 10.1016/j.epsr.2022.108635.

M. Abumohsen, A. Y. Owda, and M. Owda, “Electrical Load Forecasting Using LSTM, GRU, and RNN Algorithms,” Energies, vol. 16, no. 5, pp. 1–31, 2023, doi: 10.3390/en16052283.

M. Zhang, Z. Yu, and Z. Xu, “Short-term load forecasting using recurrent neural networks with input attention mechanism and hidden connection mechanism,” IEEE Access, vol. 8, pp. 186514–186529, 2020, doi: 10.1109/ACCESS.2020.3029224.

H. Shahinzadeh, H. Sadrarhami, M. M. Hayati, H. Majidi-Gharehnaz, M. Abapour, and G. B. Gharehpetian, “Review and Comparative Analysis of Deep Learning Techniques for Smart Grid Load Forecasting,” in 2024 20th CSI International Symposium on Artificial Intelligence and Signal Processing, AISP 2024, pp. 1–9, 2024, doi: 10.1109/AISP61396.2024.10475303.

O. A. Lawal and J. Teh, “Assessment of dynamic line rating forecasting methods,” Electric Power Systems Research, vol. 214, p. 108807, 2023, doi: 10.1016/j.epsr.2022.108807.

A. K. Mishra, P. Mishra, and H. D. Mathur, “A deep learning assisted adaptive nonlinear deloading strategy for wind turbine generator integrated with an interconnected power system for enhanced load frequency control,” Electric Power Systems Research, vol. 214, 2023, doi: 10.1016/j.epsr.2022.108960.

S. Singh and M. M. Tripathi, “A Comparative Analysis of Extreme Gradient Boosting Technique with Long Short-Term Memory and Layered Recurrent Neural Network for Electricity Demand Forecas,” in 2021 6th International Conference on Recent Trends on Electronics, Information, Communication and Technology, RTEICT 2021, pp. 297–302, 2021, doi: 10.1109/RTEICT52294.2021.9573988.

Z. Xu et al., “PhaCIA-TCNs: Short-term load forecasting using temporal convolutional networks with parallel hybrid activated convolution and input attention,” IEEE Transactions on Network Science and Engineering, vol. 11, no. 1, pp. 427–438, 2024, doi: 10.1109/TNSE.2023.3300744.

G. Gürses-Tran, T. A. Körner, and A. Monti, “Introducing explainability in sequence-to-sequence learning for short-term load forecasting,” Electric Power Systems Research, vol. 212, p. 108366, 2022, doi: 10.1016/j.epsr.2022.108366.

S. Luo, Y. Rao, J. Chen, H. Wang, and Z. Wang, “Short-Term Load Forecasting Model of Distribution Transformer Based on CNN and LSTM,” in 7th IEEE International Conference on High Voltage Engineering and Application, ICHVE 2020 - Proceedings, pp. 1–4, 2020, doi: 10.1109/ICHVE49031.2020.9279813.

A. K. A. Penaloza, A. Balbinot, and R. C. Leborgne, “Review of deep learning application for short-term household load forecasting,” in 2020 IEEE PES Transmission and Distribution Conference and Exhibition - Latin America, T and D LA 2020, pp. 1–6, 2020, doi: 10.1109/TDLA47668.2020.9326148.

Y. Liu, X. Wang, S. Wang, and Z. Xu, “Short-term Power Load Forecasting Based on Temporal Convolutional Network,” in Proceedings of the 2022 International Conference on Information, Control, and Communication Technologies, ICCT 2022, pp. 1–4, 2022, doi: 10.1109/ICCT56057.2022.9976543.

J. Wen, Y. Peng, W. Zhang, X. Huang, and Z. Wang, “Short-term power load forecasting based on TCN-LSTM model,” in Proceedings of 2024 IEEE 6th International Conference on Civil Aviation Safety and Information Technology, ICCASIT 2024, vol. 13, pp. 734–738, 2024, doi: 10.1109/ICCASIT62299.2024.10827868.

S. Özdemir, Y. Demir, and Ö. Yildirim, “The effect of input length on prediction accuracy in short-term multi-step electricity load forecasting: A CNN-LSTM Approach,” IEEE Access, vol. 13, pp. 28419–28432, 2025, doi: 10.1109/ACCESS.2025.3540636.

C. Li, R. Hu, C. Y. Hsu, and Y. Han, “Short-term Power Load Forecasting based on Feature Fusion of Parallel LSTM-CNN,” in 2022 IEEE 4th International Conference on Power, Intelligent Computing and Systems, ICPICS 2022, no. 4, pp. 448–452, 2022, doi: 10.1109/ICPICS55264.2022.9873566.

C. Cai, Y. Tao, Q. Ren, and G. Hu, “Short-term load forecasting based on MB-LSTM neural network,” in Proceedings - 2020 Chinese Automation Congress, CAC 2020, pp. 5402–5406, 2020, doi: 10.1109/CAC51589.2020.9326696.

H. Kuang, Q. Guo, S. Li, and H. Zhong, “Short-term Power Load Forecasting Method in Rural Areas Based on CNN-LSTM,” in Proceedings of 2021 IEEE 4th International Electrical and Energy Conference, CIEEC 2021, pp. 7–11, 2021, doi: 10.1109/CIEEC50170.2021.9510777.

T. H. Bao Huy, D. N. Vo, K. P. Nguyen, V. Q. Huynh, M. Q. Huynh, and K. H. Truong, “Short-Term Load Forecasting in Power System Using CNN-LSTM Neural Network,” in Conference Proceedings - 2023 IEEE Asia Meeting on Environment and Electrical Engineering, EEE-AM 2023, pp. 1–6, 2023, doi: 10.1109/EEE-AM58328.2023.10395221.

S. Chen, R. Lin, and W. Zeng, “Short-term load forecasting method based on ARIMA and LSTM,” in International Conference on Communication Technology Proceedings, ICCT, vol. Nov 2022, pp. 1913–1917, 2022, doi: 10.1109/ICCT56141.2022.10073051.

S. H. Rafi, N. Al-Masood, S. R. Deeba, and E. Hossain, “A short-term load forecasting method using integrated CNN and LSTM network,” IEEE Access, vol. 9, pp. 32436–32448, 2021, doi: 10.1109/ACCESS.2021.3060654.

B. Farsi, M. Amayri, N. Bouguila, and U. Eicker, “On short-term load forecasting using machine learning techniques and a novel parallel deep LSTM-CNN approach,” IEEE Access, vol. 9, pp. 31191–31212, 2021, doi: 10.1109/ACCESS.2021.3060290.

F. Xu, G. Weng, Q. Ye, and Q. Xia, “Research on Load Forecasting Based on CNN-LSTM Hybrid Deep Learning Model,” in 2022 IEEE 5th International Conference on Electronics Technology, ICET 2022, pp. 1332–1336, 2022, doi: 10.1109/ICET55676.2022.9824615.

C. Fan, G. Li, L. Xiao, L. Yi, and S. Nie, “Short-Term Power Load Forecasting in City Based on ISSA-BiTCN-LSTM,” Cognitive Computation, vol. 17, no. 1, 2025, doi: 10.1007/s12559-024-10401-1.

W. G. Buratto, R. N. Muniz, A. Nied, and G. V. Gonzalez, “Seq2Seq-LSTM With Attention for Electricity Load Forecasting in Brazil,” IEEE Access, vol. 12, pp. 30020–30029, 2024, doi: 10.1109/ACCESS.2024.3365812.

M. Xue, L. Wu, Q. P. Zhang, J. X. Lu, X. Mao, and Y. Pan, “Research on Load Forecasting of Charging Station Based on XGBoost and LSTM Model,” Journal of Physics: Conference Series, vol. 1757, no. 1, 2021, doi: 10.1088/1742-6596/1757/1/012145.

M. Alhussein, K. Aurangzeb, and S. I. Haider, “Hybrid CNN-LSTM model for short-term individual household load forecasting,” IEEE Access, vol. 8, pp. 180544–180557, 2020, doi: 10.1109/ACCESS.2020.3028281.

S. Yin, Z. Chen, W. Liu, and Z. Su, “Ultra Short-Term Charging Load Forecasting Based on Improved Data Decomposition and Hybrid Neural Network,” IEEE Access, vol. 13, pp. 58778–58789, 2025, doi: 10.1109/ACCESS.2025.3555737.

B. Li, Y. Mo, F. Gao, and X. Bai, “Short-term probabilistic load forecasting method based on uncertainty estimation and deep learning model considering meteorological factors,” Electric Power Systems Research, vol. 225, p. 109804, 2023, doi: 10.1016/j.epsr.2023.109804.

H. Shi, L. Wang, R. Scherer, M. Wozniak, P. Zhang, and W. Wei, “Short-Term Load Forecasting Based on Adabelief Optimized Temporal Convolutional Network and Gated Recurrent Unit Hybrid Neural Network,” IEEE Access, vol. 9, pp. 66965–66981, 2021, doi: 10.1109/ACCESS.2021.3076313.

H. Hu and B. Zheng, “Short-term electricity load forecasting based on CEEMDAN-FE-BiGRU-Attention model,” International Journal of Low-Carbon Technologies, vol. 19, pp. 988–995, 2024, doi: 10.1093/ijlct/ctae040.

T. A. Nguyen and T. N. Tran, “Improving short-term electrical load forecasting with dilated convolutional neural networks: a comparative analysis,” Journal of Robotics and Control (JRC), vol. 6, no. 2, pp. 560–569, 2025, doi: 10.18196/jrc.v6i2.24967.

R. Liu, T. Chen, G. Sun, S. M. Muyeen, S. Lin, and Y. Mi, “Short-term probabilistic building load forecasting based on feature integrated artificial intelligent approach,” Electric Power Systems Research, vol. 206, p. 107802, 2022, doi: 10.1016/j.epsr.2022.107802.

A. Parizad and C. J. Hatziadoniu, “A Real-Time Multistage False Data Detection Method Based on Deep Learning and Semisupervised Scoring Algorithms,” IEEE Systems Journal, vol. 17, no. 2, pp. 1753–1764, 2023, doi: 10.1109/JSYST.2023.3265021.

J. Gan, L. Pan, Y. Jin, Q. Liu, and X. Liu, “A Load Forecasting Approach Based on Graph Convolution Neural Network,” in Proceedings of the 2022 IEEE International Conference on Dependable, Autonomic and Secure Computing, International Conference on Pervasive Intelligence and Computing, International Conference on Cloud and Big Data Computing, International Conference on Cyber Science and Technology Congress, DASC/PiCom/CBDCom/CyberSciTech 2022, pp. 1–3, 2022, doi: 10.1109/DASC/PiCom/CBDCom/Cy55231.2022.9927829.

Y. Lu, G. Wang, X. Huang, S. Huang, and M. Wu, “Probabilistic load forecasting based on quantile regression parallel CNN and BiGRU networks,” Applied Intelligence, vol. 54, no. 15–16, pp. 7439–7460, 2024, doi: 10.1007/s10489-024-05540-9.

W. Xiong, L. Han, and X. Qu, “Bus Load Forecasting Based on Maximum Information Coefficient and CNN-LSTM Model,” in 2023 IEEE International Conference on Image Processing and Computer Applications, ICIPCA 2023, pp. 659–663, 2023, doi: 10.1109/ICIPCA59209.2023.10257944.

J. Zhang, Z. Zhu, and Y. Yang, “Electricity Load Forecasting Based on CNN-LSTM,” in 2023 IEEE International Conference on Electrical, Automation and Computer Engineering, ICEACE 2023, pp. 1385–1390, 2023, doi: 10.1109/ICEACE60673.2023.10442217.

S. Wu et al., “Power Load Forecasting Method Based on Random Matrix Theory and CNN-LSTM Model,” in 2022 IEEE 2nd International Conference on Digital Twins and Parallel Intelligence, DTPI 2022, pp. 1–6, 2022, doi: 10.1109/DTPI55838.2022.9998910.

O. Rubasinghe, X. Zhang, T. K. Chau, Y. H. Chow, T. Fernando, and H. H. C. Iu, “A Novel Sequence to Sequence Data Modelling Based CNN-LSTM Algorithm for Three Years Ahead Monthly Peak Load Forecasting,” IEEE Transactions on Power Systems, vol. 39, no. 1, pp. 1932–1947, 2024, doi: 10.1109/TPWRS.2023.3271325.

K. Aurangzeb, M. Alhussein, K. Javaid, and S. I. Haider, “A Pyramid-CNN based deep learning model for power load forecasting of similar-profile energy customers based on clustering,” IEEE Access, vol. 9, pp. 14992–15003, 2021, doi: 10.1109/ACCESS.2021.3053069.

M. Aouad, H. Hajj, K. Shaban, R. A. Jabr, and W. El-Hajj, “A CNN-Sequence-to-Sequence network with attention for residential short-term load forecasting,” Electric Power Systems Research, vol. 211, p. 108152, 2022, doi: 10.1016/j.epsr.2022.108152.

A. Irankhah, S. R. Saatlou, M. H. Yaghmaee, S. Ershadi-Nasab, and M. Alishahi, “A parallel CNN-BiGRU network for short-term load forecasting in demand-side management,” in 2022 12th International Conference on Computer and Knowledge Engineering, ICCKE 2022, pp. 511–516, 2022, doi: 10.1109/ICCKE57176.2022.9960036.

S. Luo, Z. Ni, X. Zhu, P. Xia, and H. Wu, “A novel methanol futures price prediction method based on multicycle CNN-GRU and Attention Mechanism,” Arabian Journal for Science and Engineering, vol. 48, no. 2, pp. 1487–1501, 2023, doi: 10.1007/s13369-022-06902-6.

X. Li, H. Guo, L. Xu, and Z. Xing, “Bayesian-Based Hyperparameter Optimization of 1D-CNN for Structural Anomaly Detection,” Sensors, vol. 23, no. 11, 2023, doi: 10.3390/s23115058.

H. Hua, M. Liu, Y. Li, S. Deng, and Q. Wang, “An ensemble framework for short-term load forecasting based on parallel CNN and GRU with improved ResNet,” Electric Power Systems Research, vol. 216, p. 109057, 2023, doi: 10.1016/j.epsr.2022.109057.

Z. Tian, W. Liu, W. Jiang, and C. Wu, “CNNs-Transformer based day-ahead probabilistic load forecasting for weekends with limited data availability,” Energy, vol. 293, p. 130666, 2024, doi: 10.1016/j.energy.2024.130666.

C. Wang, Y. Wang, Z. Ding, and K. Zhang, “Probabilistic Multi-Energy Load Forecasting for Integrated Energy System Based on Bayesian Transformer Network,” IEEE Transactions on Smart Grid, vol. 15, no. 2, pp. 1495–1508, 2024, doi: 10.1109/TSG.2023.3296647.

Y. Shang et al., “Loss of life estimation of distribution transformers considering corrupted AMI data recovery and field verification,” IEEE Transactions on Power Delivery, vol. 36, no. 1, pp. 180–190, 2021, doi: 10.1109/TPWRD.2020.2978809.

K. Qu, G. Si, Z. Shan, Q. Wang, X. Liu, and C. Yang, “Forwardformer: Efficient Transformer With Multi-Scale Forward Self-Attention for Day-Ahead Load Forecasting,” IEEE Transactions on Power Systems, vol. 39, no. 1, pp. 1421–1433, 2024, doi: 10.1109/TPWRS.2023.3266369.

P. Ran, K. Dong, X. Liu, and J. Wang, “Short-term load forecasting based on CEEMDAN and Transformer,” Electric Power Systems Research, vol. 214, p. 108885, 2023, doi: 10.1016/j.epsr.2022.108885.

I. Diahovchenko, A. Chuprun, and Z. Čonka, “Assessment and mitigation of the influence of rising charging demand of electric vehicles on the aging of distribution transformers,” Electric Power Systems Research, vol. 221, 2023, doi: 10.1016/j.epsr.2023.109455.

C. Xu and G. Chen, “Interpretable transformer-based model for probabilistic short-term forecasting of residential net load,” International Journal of Electrical Power and Energy Systems, vol. 155, p. 109515, 2024, doi: 10.1016/j.ijepes.2023.109515.

A. Ahmad, X. Xiao, H. Mo, and D. Dong, “TFTformer: A novel transformer based model for short-term load forecasting,” International Journal of Electrical Power and Energy Systems, vol. 166, p. 110549, 2025, doi: 10.1016/j.ijepes.2025.110549.

W. Zeng et al., “Hybrid CEEMDAN-DBN-ELM for online DGA serials and transformer status forecasting,” Electric Power Systems Research, vol. 217, 2023, doi: 10.1016/j.epsr.2023.109176.

H. Tong and J. Liu, “MFformer: An improved transformer-based multi-frequency feature aggregation model for electricity load forecasting,” Electric Power Systems Research, vol. 243, p. 111492, 2025, doi: 10.1016/j.epsr.2025.111492.

Z. Tang, T. Ji, J. Kang, Y. Huang, and W. Tang, “Learning global and local features of power load series through transformer and 2D-CNN: An image-based multi-step forecasting approach incorporating phase space reconstruction,” Applied Energy, vol. 378, p. 124786, 2025, doi: 10.1016/j.apenergy.2024.124786.

T. Quanwei, X. Guijun, and X. Wenju, “Cakformer: Transformer model for long-term heat load forecasting based on Cauto-correlation and KAN,” Energy, vol. 324, p. 135460, 2025, doi: 10.1016/j.energy.2025.135460.

T. Bashir, H. Wang, M. Tahir, and Y. Zhang, “Wind and solar power forecasting based on hybrid CNN-ABiLSTM, CNN-transformer-MLP models,” Renewable Energy, vol. 239, p. 122055, 2025, doi: 10.1016/j.renene.2024.122055.

S. Mo et al., “From global to local: A lightweight CNN approach for long-term time series forecasting,” Computers and Electrical Engineering, vol. 123, p. 110192, 2025, doi: 10.1016/j.compeleceng.2025.110192.

D. Wang, D. Peng, D. Huang, H. Zhao, and B. Qu, “MMEMformer: A multi-scale memory-enhanced transformer framework for short-term load forecasting in integrated energy systems,” Energy, vol. 322, p. 135762, 2025, doi: 10.1016/j.energy.2025.135762.

X. Cao et al., “From Dense to Sparse: Event Response for Enhanced Residential Load Forecasting,” IEEE Transactions on Instrumentation and Measurement, vol. 74, pp. 1–12, 2025, doi: 10.1109/TIM.2025.3544349.

J. Liu et al., “Temporal patterns decomposition and Legendre projection for long-term time series forecasting,” Journal of Supercomputing, vol. 80, no. 16, pp. 23407–23441, 2024, doi: 10.1007/s11227-024-06313-4.

Downloads

Published

2025-08-02

How to Cite

[1]
T. A. Nguyen and T. N. Tran, “A Hybrid Transformer-MLP Approach for Short-Term Electric Load Forecasting”, J Robot Control (JRC), vol. 6, no. 4, pp. 2033–2044, Aug. 2025.

Issue

Section

Articles