On the Convergence of No-Regret Dynamics in Information Retrieval Games with Proportional Ranking Functions (2405.11517v3)
Abstract: Publishers who publish their content on the web act strategically, in a behavior that can be modeled within the online learning framework. Regret, a central concept in machine learning, serves as a canonical measure for assessing the performance of learning agents within this framework. We prove that any proportional content ranking function with a concave activation function induces games in which no-regret learning dynamics converge. Moreover, for proportional ranking functions, we prove the equivalence of the concavity of the activation function, the social concavity of the induced games and the concavity of the induced games. We also study the empirical trade-offs between publishers' and users' welfare, under different choices of the activation function, using a state-of-the-art no-regret dynamics algorithm. Furthermore, we demonstrate how the choice of the ranking function and changes in the ecosystem structure affect these welfare measures, as well as the dynamics' convergence rate.
- On the convergence of no-regret learning dynamics in time-varying games. Advances in Neural Information Processing Systems, 36, 2024.
- Heads-up limit hold’em poker is solved. Science, 347(6218):145–149, 2015.
- The probability ranking principle is not optimal in adversarial retrieval settings. In Proceedings of the 2015 International Conference on The Theory of Information Retrieval, pages 51–60, 2015.
- Regret minimization and the price of total anarchy. In Proceedings of the fortieth annual ACM symposium on Theory of computing, pages 373–382, 2008.
- From recommendation systems to facility location games. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 1772–1779, 2019.
- Convergence of learning dynamics in information retrieval games. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 1780–1787, 2019.
- Shapley facility location games. In Web and Internet Economics: 13th International Conference, WINE 2017, Bangalore, India, December 17–20, 2017, Proceedings 13, pages 58–73. Springer, 2017.
- A game-theoretic approach to recommendation systems with strategic content providers. Advances in Neural Information Processing Systems, 31, 2018.
- Superhuman ai for heads-up no-limit poker: Libratus beats top professionals. Science, 359(6374):418–424, 2018.
- A game theoretic analysis of the adversarial retrieval setting. Journal of Artificial Intelligence Research, 60:1127–1164, 2017.
- Near-optimal no-regret learning in general games. Advances in Neural Information Processing Systems, 34:27604–27616, 2021.
- On the convergence of regret minimization dynamics in concave games. In Proceedings of the Forty-First Annual ACM Symposium on Theory of Computing, STOC ’09, page 523–532. Association for Computing Machinery, 2009.
- Strong robustness to incomplete information and the uniqueness of a correlated equilibrium. Economic Theory, 73(1):91–119, 2022.
- Near-optimal no-regret learning dynamics for general convex games. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh, editors, Advances in Neural Information Processing Systems, volume 35, pages 39076–39089. Curran Associates, Inc., 2022.
- Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29(1-2):79–103, 1999.
- Calibrated learning and correlated equilibrium. Games and Economic Behavior, 21(1-2):40, 1997.
- Modeling content creator incentives on algorithm-curated platforms. arXiv preprint arXiv:2206.13102, 2022.
- A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68(5):1127–1150, 2000.
- Supply-side equilibria in recommender systems. arXiv preprint arXiv.2206.13489, 2023.
- Let’s be honest: An optimal no-regret framework for zero-sum games. In International Conference on Machine Learning, pages 2488–2496. PMLR, 2018.
- Competitive search. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 2838–2849, 2022.
- The search for stability: Learning dynamics of strategic publishers with initial documents. arXiv preprint arXiv:2305.16695, 2024.
- Deepstack: Expert-level artificial intelligence in heads-up no-limit poker. Science, 356(6337):508–513, 2017.
- Digital content creation: An analysis of the impact of recommendation systems. Management Science, 2024.
- Stephen E Robertson. The probability ranking principle in ir. Journal of documentation, 1977.
- J Ben Rosen. Existence and uniqueness of equilibrium points for concave n-person games. Econometrica: Journal of the Econometric Society, pages 520–534, 1965.
- Tim Roughgarden. Intrinsic robustness of the price of anarchy. Journal of the ACM (JACM), 62(5):1–42, 2015.
- Information retrieval meets game theory: The ranking competition between documents’ authors. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 465–474, 2017.
- Optimization, learning, and games with predictable sequences. Advances in Neural Information Processing Systems, 26, 2013.
- Fast convergence of regularized learning in games. Advances in Neural Information Processing Systems, 28, 2015.
- Takashi Ui. Correlated equilibrium and concave games. International Journal of Game Theory, 37(1):1–13, 2008.
- Rethinking incentives in recommender systems: Are monotone rewards always beneficial? Advances in Neural Information Processing Systems, 36, 2024.
- YouTube. Youtube partner program overview & eligibility, 2023.
- Dense text retrieval based on pretrained language models: A survey, 2022.
- Learning to retrieve: How to train a dense retrieval model effectively and efficiently. arXiv preprint arXiv:2010.10469, 2020.