Bayesian Enhanced Stacking Method Integrating Multi-Model Advantages and its Application in Olympic Medal Prediction

Authors

  • Lu Kun
  • Yichen Du
  • Gaojun Jin
  • Wenbo Kuang
  • Jiaxi Li

DOI:

https://doi.org/10.54691/wdr4xc91

Keywords:

Olympic medal prediction, bayesian ensemble learning, stacking model, time series analysis.

Abstract

The Olympic medal tally serves as a key indicator for measuring a nation's comprehensive athletic strength, and accurate prediction holds significant importance for sports resource allocation and strategic planning. This study utilizes historical data from 163 countries/regions across Summer Olympics from 1896 to 2024. First, K-means clustering is employed to categorize participating nations into high, medium, and low medal-winning groups based on their performance levels, addressing data heterogeneity. Subsequently, we propose the Bayesian Enhanced Stacking Model (BESM). This model integrates three base learners-LightGBM, Attention-LSTM, and Bayesian Negative Binomial Regression-to capture nonlinear features, long-term temporal dependencies, and count data distribution characteristics, respectively. At the meta-learning layer, Bayesian Linear Regression enables adaptive weight allocation and uncertainty quantification. Systematic comparisons against mainstream models (SARIMA, Prophet, Random Forest, XGBoost, CNN) evaluated performance using MAE, RMSE, NB_Deviance, and SMAPE. Results demonstrate BESM's superiority across all groups: achieving the best performance on at least one metric in 88.5% of cases within the low-tier group (statistically significant, p<0.0001); demonstrated strong overall adaptability in the medium-medal group; and achieved the best SMAPE (33.3%) in the high-medal group. Traditional models exhibited pronounced group dependency and unstable performance. This study validates BESM's effectiveness in handling heterogeneous, sparse sports data, providing an accurate and reliable solution for Olympic medal prediction and national sports strategy formulation.

Downloads

Download data is not yet available.

References

[1] Bernard A B, Busse M R. Who wins the Olympic Games: Economic resources and medal totals[J]. Review of economics and statistics, 2004, 86(1): 413-417.

[2] **ang B, Zhao B. Olympic medal analysis and prediction[C]//Proceedings of the 2025 5th International Conference on Applied Mathematics, Modelling and Intelligent Computing. 2025: 765-772.

[3] Guo Y F. FORECASTING OLYMPIC MEDAL COUNTS: A MULTIPLE LINEAR REGRESSION MODEL[J]. 2025.

[4] Zulfikar R, STp M M. Estimation model and selection method of panel data regression: An overview of common effect, fixed effect, and random effect model[J]. JEMA: Jurnal Ilmiah Bidang Akuntansi, 2018, 9(2): 1-10.

[5] Wang Z, Yang W. Olympic Medal Prediction Based on Multiple Regression and Time Series Analysis[J]. IJHSA, 2025, 46(4): 6491-6502.

[6] Schlembach C, Schmidt S L, Schreyer D, et al. Forecasting the Olympic medal distribution during a pandemic: a socio-economic machine learning model[J]. arxiv preprint arxiv:2012.04378, 2020.

[7] Wu X, He J, Yip T, et al. A two-stage random forest method for short-term load forecasting[C]//2016 IEEE Power and Energy Society General Meeting (PESGM). IEEE, 2016: 1-5.

[8] Zhang Z, Ma T, Yao Y, et al. Predicting Olympic Medal Performance for 2028: Machine Learning Models and the Impact of Host and Coaching Effects[J]. Applied Sciences, 2025, 15(14): 7793.

[9] Li J, Cheng J, Shi J, et al. Brief introduction of back propagation (BP) neural network algorithm and its improvement[M]//Advances in computer science and information engineering: volume 2. Berlin, Heidelberg: Springer Berlin Heidelberg, 2012: 553-558.

[10] Chen T, Guestrin C. Xgboost: A scalable tree boosting system[C]//Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 2016: 785-794.

[11] Wang H, Hu D. Comparison of SVM and LS-SVM for regression[C]//2005 International conference on neural networks and brain. IEEE, 2005, 1: 279-283.

[12] Zhang C, Chen M. Prediction of Undrained Shear Strength Utilizing a Hybrid Stacking Model Enhanced by Bayesian Optimization Algorithm[J]. Transportation Research Record, 2025, 2679(3): 457-471.

Downloads

Published

2026-03-31

Issue

Section

Articles

How to Cite

Kun, Lu, Yichen Du, Gaojun Jin, Wenbo Kuang, and Jiaxi Li. 2026. “Bayesian Enhanced Stacking Method Integrating Multi-Model Advantages and Its Application in Olympic Medal Prediction”. Scientific Journal of Intelligent Systems Research 8 (3): 101-10. https://doi.org/10.54691/wdr4xc91.