Performance rating in chess, tennis, and other contexts
Abstract: In this note, I introduce Estimated Performance Rating (PR$e$), a novel system for evaluating player performance in sports and games. PR$e$ addresses a key limitation of the Tournament Performance Rating (TPR) system, which is undefined for zero or perfect scores in a series of games. PR$e$ is defined as the rating that solves an optimization problem related to scoring probability, making it applicable for any performance level. The main theorem establishes that the PR$e$ of a player is equivalent to the TPR whenever the latter is defined. I then apply this system to historically significant win-streaks in association football, tennis, and chess. Beyond sports, PR$e$ has broad applicability in domains where Elo ratings are used, from college rankings to the evaluation of LLMs.
- Paul CH Albers and Han de Vries “Elo-rating as a tool in the sequential estimation of dominance strengths” In Animal Behaviour Elsevier, 2001, pp. 489–495
- Nejat Anbarci, Ching-Jen Sun and M Utku Ünver “Designing practical and fair sequential team contests: The case of penalty shootouts” In Games and Economic Behavior 130 Elsevier, 2021, pp. 25–43
- “Psychological pressure in competitive environments: Evidence from a randomized natural experiment” In American Economic Review 100.5, 2010, pp. 2548–64
- “Fair elimination-type competitions” In European Journal of Operational Research 287.2, 2020, pp. 528–535 DOI: 10.1016/j.ejor.2020.03.025
- “A Revealed Preference Ranking of U.S. Colleges and Universities” In The Quarterly Journal of Economics 128.1, 2012, pp. 425–467 DOI: 10.1093/qje/qjs043
- “Gender, competition, and performance: Evidence from chess players” In Quantitative Economics 14.1 Wiley Online Library, 2023, pp. 349–380
- Steven J Brams and Mehmet S Ismail “Making the Rules of Sports Fairer” In SIAM Review 60.1 SIAM, 2018, pp. 181–202
- “Catch-Up: A Rule That Makes Service Sports More Competitive” In The American Mathematical Monthly 125.9 Taylor & Francis, 2018, pp. 771–796
- Danny Cohen-Zada, Alex Krumer and Offer Moshe Shapir “Testing the effect of serve order in tennis tiebreak” In Journal of Economic Behavior & Organization 146 Elsevier, 2018, pp. 106–115
- László Csató “UEFA Champions League entry has not satisfied strategyproofness in three seasons” In Journal of Sports Economics 20.7 Sage Publications Sage CA: Los Angeles, CA, 2019, pp. 975–981
- László Csató “Tournament Design: How Operations Research Can Improve Sports Rules” Springer Nature, 2021
- “Winning by Losing: Incentive Incompatibility in Multiple Qualifiers” In Journal of Sports Economics 19.8, 2018, pp. 1122–1146 DOI: 10.1177/1527002517704022
- Arpad E Elo “The Rating of Chess Players, Past and Present” New York: Arco Publishing, 1978
- FIDE “FIDE Handbook” Accessed: 01.12.2023, https://handbook.fide.com/chapter/B022022, 2022
- FIFA “Revision of the FIFA / Coca-Cola World Ranking” Accessed: 2023-12-17, https://digitalhub.fifa.com/m/f99da4f73212220/original/edbm045h0udbwkqew35a-pdf.pdf, 2018
- Mark E Glickman “The Glicko system”, 1995, pp. 9
- Dries R Goossens and Frits CR Spieksma “The carryover effect does not influence football results” In Journal of Sports Economics 13.3 Sage Publications Sage CA: Los Angeles, CA, 2012, pp. 288–305
- “Computer analysis of world chess champions” In ICGA Journal 29.2 IOS Press, 2006, pp. 65–73
- Lars Magnus Hvattum and Halvard Arntzen “Using ELO ratings for match result prediction in association football” Sports Forecasting In International Journal of Forecasting 26.3, 2010, pp. 460–470 DOI: 10.1016/j.ijforecast.2009.10.002
- Mehmet S Ismail “Human and Machine Intelligence in n𝑛nitalic_n-Person Games with Partial Knowledge” In arXiv preprint arXiv:2302.13937, 2023
- Graham Kendall and Liam J.A. Lenten “When sports rules go awry” In European Journal of Operational Research 257.2, 2017, pp. 377–394 DOI: 10.1016/j.ejor.2016.06.050
- Maya Kosoff “There’s a secret Tinder rating system and your score can only be seen by the company” Accessed: 2023-12-17, https://www.businessinsider.com/secret-tinder-rating-system-called-elo-score-can-only-be-seen-by-company-2016-1, 2016
- Steffen Künn, Christian Seel and Dainis Zegners “Cognitive Performance in Remote Work: Evidence from Professional Chess” In The Economic Journal 132.643, 2022, pp. 1218–1232 DOI: 10.1093/ej/ueab094
- Roel Lambers and Frits C R Spieksma “A mathematical analysis of fairness in shootouts” In IMA Journal of Management Mathematics 32.4, 2021, pp. 411–424 DOI: 10.1093/imaman/dpaa023
- Ignacio Palacios-Huerta “The Beautiful Dataset” In Available at SSRN 4665889, 2023
- Marc Pauly “Can strategizing in round-robin subtournaments be avoided?” In Social Choice and Welfare 43.1, 2014, pp. 29–46
- Radek Pelánek “Applications of the Elo rating system in adaptive educational systems” In Computers & Education 98, 2016, pp. 169–179 DOI: 10.1016/j.compedu.2016.03.017
- “Intrinsic Chess Ratings” In Proceedings of the AAAI Conference on Artificial Intelligence 25.1, 2011, pp. 834–839 DOI: 10.1609/aaai.v25i1.7951
- Philip Scarf, Muhammad Mat Yusof and Mark Bilbao “A numerical study of designs for sporting contests” In European Journal of Operational Research 198.1, 2009, pp. 190–198 DOI: 10.1016/j.ejor.2008.07.029
- In Journal of Quantitative Analysis in Sports 17.2, 2021, pp. 91–105 DOI: doi:10.1515/jqas-2019-0110
- “Judging LLM-as-a-judge with MT-Bench and Chatbot Arena”, 2023 arXiv:2306.05685
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.