Stochastic analysis of the Elo rating algorithm in round-robin tournaments (2212.12015v2)
Abstract: The Elo algorithm, renowned for its simplicity, is widely used for rating in sports tournaments and other applications. However, despite its widespread use, a detailed understanding of the convergence characteristics of the Elo algorithm is still lacking. Aiming to fill this gap, this paper presents a comprehensive (stochastic) analysis of the Elo algorithm, considering round-robin tournaments. Specifically, analytical expressions are derived describing the evolution of the skills and performance metrics. Then, taking into account the relationship between the behavior of the algorithm and the step-size value, which is a hyperparameter that can be controlled, design guidelines and discussions about the performance of the algorithm are provided. Experimental results are shown confirming the accuracy of the analysis and illustrating the applicability of the theoretical findings using real-world data obtained from SuperLega, the Italian volleyball league.
- R. Stefani, “The methodology of officially recognized international sports rating systems,” Journal of Quantitative Analysis in Sports, vol. 7, 2011.
- D. Barrow, I. Drayer, P. Elliott, G. Gaut, and B. Osting, “Ranking rankings: an empirical comparison of the predictive power of sports ranking methods,” Journal of Quantitative Analysis in Sports, vol. 9, pp. 187–202, 2013.
- J. Lasek and M. Gagolewski, “Interpretable sports team rating models based on the gradient descent algorithm,” International Journal of Forecasting, vol. 37, no. 3, pp. 1061–1071, 2021.
- L. M. Hvattum and H. Arntzen, “Using ELO ratings for match result prediction in association football,” International Journal of Forecasting, vol. 26, no. 3, pp. 460–470, 2010.
- S. Wolf, M. Schmitt, and B. Schuller, “A football player rating system,” Journal of Sports Analytics, vol. 6, no. 4, pp. 243–257, 2020.
- R. A. Bradley and M. E. Terry, “Rank analysis of incomplete block designs: I. The method of paired comparisons,” Biometrika, vol. 39, no. 3–4, pp. 324––345, 1952.
- M. E. Glickman, “Paired comparison models with time-varying parameters,” Ph.D. dissertation, Harvard University, 1993.
- C. Leitner, A. Zeileis, and K. Hornik, “Forecasting sports tournaments by ratings of (prob)abilities: A comparison for the EURO 2008,” International Journal of Forecasting, vol. 26, no. 3, pp. 471–4481, 2010.
- L. Vaughan Williams and H. O. Stekler, “Editorial: Sports forecasting,” International Journal of Forecasting, vol. 26, no. 3, pp. 445–447, 2010.
- I. McHale and T. Swartz, “Editorial: Forecasting in sports,” International Journal of Forecasting, vol. 35, no. 2, pp. 710–711, 2019.
- H. O. Stekler, D. Sendor, and R. Verlander, “Issues in sports forecasting,” International Journal of Forecasting, vol. 26, no. 3, pp. 606–621, Jul.–Sept. 2010.
- R. Ryall and A. Bedford, “An optimized ratings-based model for forecasting Australian Rules football,” International Journal of Forecasting, vol. 26, no. 3, pp. 511–517, 2010.
- M. E. Glickman, “A comprehensive guide to chess ratings,” American Chess Journal, no. 3, pp. 59–102, 1995.
- Fédération Internationale des Échecs, “FIDE Handbook: Rating Regulations effective from 1 January 2022,” https://archive.ph/T5Rb3, 2022, accessed: 2021-12-07.
- H. Van Eetvelde and C. Ley, “Ranking methods in soccer,” in Wiley StatsRef: Statistics Reference Online, R. S. Kenett, T. N. Longford, W. Piegorsch, and F. Ruggeri, Eds. John Wiley & Sons, Ltd, 2019, pp. 1–9.
- FIFA, “FIFA/Coca-Cola Women’s World Ranking,” https://digitalhub.fifa.com/m/3d9cb1decbbb2ac7/original/rxqyxdjhbs2qdtstluy6-pdf.pdf, accessed: 2021-11-12.
- FIFA, “FIFA: Men’s Ranking Procedure,” https://archive.ph/3Ch5V, 2018, accessed: 2021-12-07.
- Play! Pokémon, “Ratings and rankings (FAQ),” https://archive.ph/ZTecO, 2021, accessed: 2021-12-07.
- N. Silver, “Introducing NFL Elo Ratings,” https://fivethirtyeight.com/features/introducing-nfl-elo-ratings/, 2014, accessed: 2020-07-1.
- J. Carbone, T. Corke, and F. Moisiadis, “The rugby league prediction model: Using an Elo-based approach to predict the outcome of National Rugby League (NRL) matches,” International Educational Scientific Research Journal, vol. 2, pp. 26–30, May 2016.
- FiveThirtyEight, “NBA Elo ratings,” https://fivethirtyeight.com/tag/nba-elo-ratings/, 2022, accessed: 2022-06-01.
- M. E. Glickman, “Parameter estimation in large dynamic paired comparison experiments,” Journal of the Royal Statistical Society: Series C (Applied Statistics), vol. 48, no. 3, pp. 377–394, 1999.
- R. Herbrich and T. Graepel, “TrueSkill(TM): A Bayesian skill rating system,” Tech. Rep. MSR-TR-2006-80, January 2006. [Online]. Available: https://www.microsoft.com/en-us/research/publication/trueskilltm-a-bayesian-skill-rating-system-2/
- J. Lasek, Z. Szlávik, and S. Bhulai, “The predictive power of ranking systems in association football,” International Journal of Applied Pattern Recognition, vol. 1, no. 1, pp. 27–46, 2013.
- J. Dorsey, “Elo regression extending the Elo rating system,” Master’s thesis, University of Akron, Akron, OH, USA, 2019.
- S. Kovalchik, “Extension of the Elo rating system to margin of victory,” International Journal of Forecasting, vol. 36, no. 4, pp. 1329–1341, 2020.
- L. Szczecinski and A. Djebbi, “Understanding draws in Elo rating algorithm,” Journal of Quantitative Analysis in Sports, vol. 16, no. 3, pp. 211–220, 2020.
- E. Wheatcroft, “Forecasting football matches by predicting match statistics,” Journal of Sports Analytics, vol. 7, pp. 77–97, 2021.
- L. Szczecinski, “G-Elo: Generalization of the Elo algorithm by modeling the discretized margin of victory,” Journal of Quantitative Analysis in Sports, vol. 18, no. 1, pp. 1–14, 2022.
- L. Szczecinski and I.-I. Roatis, “FIFA ranking: Evaluation and path forward,” Journal of Sports Analytics, vol. 8, no. 4, pp. 231–250, dec 2022.
- eloratings.net, “World Football Elo Ratings,” https://www.eloratings.net/, 2020, accessed: 2020-09-08.
- M. Chater, L. Arrondel, J.-P. Gayant, and J.-F. Laslier, “Fixing match-fixing: Optimal schedules to promote competitiveness,” European Journal of Operational Research, vol. 294, no. 2, pp. 673–683, 2021.
- L. Csató, “Quantifying incentive (in)compatibility: A case study from sports,” European Journal of Operational Research, vol. 302, no. 2, pp. 717–726, 2022.
- ——, “How to avoid uncompetitive games? The importance of tie-breaking rules,” European Journal of Operational Research, vol. 307, no. 3, pp. 1260–1269, 2023.
- P.-E. Jabin and S. Junca, “A continuous model for ratings,” SIAM Journal on Applied Mathematics, vol. 75, no. 2, pp. 420–442, Mar. 2015.
- D. Aldous, “Elo ratings and the sports model: A neglected topic in applied probability?” Statistical Science, vol. 32, no. 4, pp. 616–629, Nov. 2017.
- O. Tobias and R. Seara, “Leaky delayed LMS algorithm: stochastic analysis for Gaussian data and delay modeling error,” IEEE Transactions on Signal Processing, vol. 52, no. 6, pp. 1596–1606, 2004.
- E. V. Kuhn, F. das Chagas de Souza, R. Seara, and D. R. Morgan, “On the stochastic modeling of the IAF-PNLMS algorithm for complex and real correlated Gaussian input data,” Signal Processing, vol. 99, pp. 103–115, 2014.
- M. V. Matsuo, E. V. Kuhn, and R. Seara, “Stochastic analysis of the NLMS algorithm for nonstationary environment and deficient length adaptive filter,” Signal Processing, vol. 160, pp. 190–201, Jul. 2019.
- ——, “On the diffusion NLMS algorithm applied to adaptive networks: Stochastic modeling and performance comparisons,” Digital Signal Processing, vol. 113, Identifier: 103018, Jun. 2021.
- K. J. Bakri, E. V. Kuhn, R. Seara, J. Benesty, C. Paleologu, and S. Ciochină, “On the stochastic modeling of the LMS algorithm operating with bilinear forms,” Digital Signal Processing, vol. 122, Identifier: 103359, Apr. 2022.
- K. J. Bakri, E. V. Kuhn, M. V. Matsuo, and R. Seara, “On the behavior of a combination of adaptive filters operating with the NLMS algorithm in a nonstationary environment,” Signal Processing, vol. 196, Identifier: 108465, Jul. 2022.
- S. Szymanski, “The economic design of sporting contests,” Journal of Economic Literature, vol. 41, no. 4, pp. 1137–1187, 2003. [Online]. Available: http://www.jstor.org/stable/3217458
- P. Scarf and M. Bilbao, “The optimal design of sporting contests,” Salford Business School Working Paper Series, pp. 1–17, Article ID: 320/06, Sept. 2006.
- P. Scarf, M. M. Yusof, and M. Bilbao, “A numerical study of designs for sporting contests,” European Journal of Operational Research, vol. 198, no. 1, pp. 190–198, 2009.
- J. Lasek and M. Gagolewski, “The efficacy of league formats in ranking teams,” Statistical Modelling, vol. 18, no. 5-6, pp. 411–435, 2018.
- J. González-Díaz and I. Palacios-Huerta, “Cognitive performance in competitive environments: Evidence from a natural experiment,” Journal of Public Economics, vol. 139, pp. 40–52, jul 2016.
- R. Darrell Bock, “Estimating item parameters and latent ability when responses are scored in two or more nominal categories,” Psychometrika, vol. 37, no. 1, pp. 29–51, 1972.
- R. Gramacy, S. Jensen, and M. Taddy, “Estimating player contribution in hockey with regularized logistic regression,” Journal of Quantitative Analysis in Sports, vol. 9, no. 1, pp. 97–111, 2013.
- FlashScore.ca, “Volleyball: Superlega Results Archive,” https://archive.ph/gP3Ea, 2021, accessed: 2021-11-15.