Algorithmic Collusion And The Minimum Price Markov Game
Abstract: This paper introduces the Minimum Price Markov Game (MPMG), a theoretical model that reasonably approximates real-world first-price markets following the minimum price rule, such as public auctions. The goal is to provide researchers and practitioners with a framework to study market fairness and regulation in both digitized and non-digitized public procurement processes, amid growing concerns about algorithmic collusion in online markets. Using multi-agent reinforcement learning-driven artificial agents, we demonstrate that (i) the MPMG is a reliable model for first-price market dynamics, (ii) the minimum price rule is generally resilient to non-engineered tacit coordination among rational actors, and (iii) when tacit coordination occurs, it relies heavily on self-reinforcing trends. These findings contribute to the ongoing debate about algorithmic pricing and its implications.
- J. A. Gerlick and S. M. Liozu, “Ethical and legal considerations of artificial intelligence and algorithmic decision-making in personalized pricing,” Journal of Revenue and Pricing Management 19 (4, 2020) 85–98. https://link.springer.com/article/10.1057/s41272-019-00225-2.
- L. L. Gormsen, “Algorithmic antitrust and consumer choice,” Economic Analysis of Law in European Legal Scholarship 12 (2022) 65–86.
- A. Ezrachi and M. Stucke, “Sustainable and unchallenged algorithmic tacit collusion,” Northwestern Journal of Technology and Intellectual Property 17 (2020) .
- Q. Li, N. Philipsen, C. Cauffman, and C. C. Nl, “Ai-enabled price discrimination as an abuse of dominance: a law and economics analysis,” China-EU Law Journal 2023 9:1 9 (4, 2023) 51–72. https://link.springer.com/article/10.1007/s12689-023-00099-z.
- E. Calvano, G. Calzolari, V. Denicolò, and S. Pastorello, “Artificial intelligence, algorithmic pricing, and collusion,” American Economic Review 110 (10, 2020) 3267–97.
- S. Assad, R. Clark, D. Ershov, and L. Xu, “Algorithmic pricing and competition: Empirical evidence from the german retail gasoline market,” SSRN Electronic Journal (11, 2020) . https://papers.ssrn.com/abstract=3682021.
- M. H. Rothkopf, “Daily repetition: A neglected factor in the analysis of electricity auctions,” The Electricity Journal 12 (1999) 60–70. https://sci-hub.st/https://www.sciencedirect.com/science/article/pii/S104061909900010X.
- Y. Ye, D. Qiu, M. Sun, D. Papadaskalopoulos, and G. Strbac, “Deep reinforcement learning for strategic bidding in electricity markets,” IEEE Transactions on Smart Grid (8, 2019) 1–1.
- A. C. Tellidou and A. G. Bakirtzis, “Agent-based analysis of capacity withholding and tacit collusion in electricity markets,” IEEE Transactions on Power Systems 22 (11, 2007) 1735–1742.
- J. Viehmann, S. Lorenczik, and R. Malischek, “Multi-unit multiple bid auctions in balancing markets: An agent-based q-learning approach,” Energy Economics 93 (2021) .
- I. Matsukawa, “Detecting collusion in retail electricity markets: Results from japan for 2005 to 2010,” Utilities Policy 57 (4, 2019) 16–23.
- B. Baranek, V. Titl, and L. Musolff, “Detection of collusive networks in e-procurement,” SSRN Electronic Journal (6, 2021) . https://papers.ssrn.com/abstract=3864186.
- J.-J. Laffont and D. Martimort, “Collusion under asymmetric information,” Econometrica 65 (1997) 875. https://www.researchgate.net/publication/4898655.
- B. Lebrun, “Uniqueness of the equilibrium in first-price auctions,” Games and Economic Behavior 55 (4, 2006) 131–151.
- D. Bergemann, B. Brooks, and S. Morris, “First-price auctions with general information structures: Implications for bidding and revenue,” Econometrica 85 (1, 2017) 107–143. https://onlinelibrary.wiley.com/doi/full/10.3982/ECTA13958https://onlinelibrary.wiley.com/doi/abs/10.3982/ECTA13958https://onlinelibrary.wiley.com/doi/10.3982/ECTA13958.
- Y. Chassin and M. Joanis, “Détecter et prévenir la collusion dans les marchés publics en construction: Meilleures pratiques favorisant la concurrence,” 2010. https://www.researchgate.net/publication/254398401%0ADétecter.
- J. E. H. Jr., “Detecting cartels,” Handbook of Antitrust Economics (2008) 213–258.
- R. P. Mcafee and J. Mcmillan, “Bidding rings,” 1992. https://www.jstor.org/stable/2117323.
- B. K. O. Tas, “Collusion detection in public procurement with limited information,” SSRN Electronic Journal (2017) .
- S. Chassang and J. Ortner, “Collusion in auctions with constrained bids: Theory and evidence from public procurement,” Journal of Political Economy 127 (2019) 2269–2300.
- E. Calvano, G. Calzolari, V. Denicolò, J. E. Harrington, and S. Pastorello, “Protecting consumers from collusive prices due to ai,” Science 370 (11, 2020) 1040–1042. https://www.science.org/doi/10.1126/science.abe3796.
- R. Axelrod, “The emergence of cooperation among egoists,” American Political Science Review 75 (6, 1981) 306–318. https://www.cambridge.org/core/journals/american-political-science-review/article/abs/emergence-of-cooperation-among-egoists/EEAB3C6460F5BC63A4DE813E1B010B21.
- I. Sadoune, M. Joanis, and A. Lodi, “Implementing a hierarchical deep learning approach for simulating multilevel auction data,” Computational Economics 2024 (5, 2024) 1–28. https://link.springer.com/article/10.1007/s10614-024-10622-4.
- C. Graf, V. Zobernig, J. Schmidt, and C. Klöckl, “Computational performance of deep reinforcement learning to find nash equilibria,” Computational Economics (2023) .
- M. Bichler, M. Fichtl, and M. Oberlechner, “Computing bayes nash equilibrium strategies in auction games via simultaneous online dual averaging,” EC 2023 - Proceedings of the 24th ACM Conference on Economics and Computation (8, 2022) 294. https://arxiv.org/abs/2208.02036v2.
- G. Brero, B. Lubin, and S. Seuken, “Probably approximately efficient combinatorial auctions via machine learning,” Proceedings of the AAAI Conference on Artificial Intelligence 31 (2, 2017) 397–405. https://ojs.aaai.org/index.php/AAAI/article/view/10624.
- M. Beyeler, G. Brero, B. Lubin, and S. Seuken, “imlca: Machine learning-powered iterative combinatorial auctions with interval bidding,”. https://dl.acm.org/doi/10.1145/3465456.3467535.
- P. Hummel and R. P. McAfee, “Machine learning in an auction environment,” WWW 2014 - Proceedings of the 23rd International Conference on World Wide Web 17 (2014) 7–17.
- K. T. Nguyen, “A bandit learning algorithm and applications to auction design,” Advances in Neural Information Processing Systems 33 (2020) 12070–12079.
- G. Gao, H. Huang, M. Xiao, J. Wu, Y. E. Sun, and S. Zhang, “Auction-based combinatorial multi-armed bandit mechanisms with strategic arms,” Proceedings - IEEE INFOCOM 2021-May (5, 2021) .
- S. Basu and A. Sankararaman, “Double auctions with two-sided bandit feedback,”. https://arxiv.org/abs/2208.06536v1.
- D. Goktas, S. Zhao, and A. Greenwald, “Zero-sum stochastic stackelberg games,” Advances in Neural Information Processing Systems 35 (12, 2022) 11658–11672.
- MIT Press, 1991.
- G. Brero, A. Eden, D. Chakrabarti, M. Gerstgrasser, A. Greenwald, V. Li, and D. C. Parkes, “Stackelberg pomdp: A reinforcement learning approach for economic design,”. https://arxiv.org/abs/2210.03852v3.
- G. Brero, A. Eden, M. Gerstgrasser, D. Parkes, and D. Rheingans-Yoo, “Reinforcement learning of sequential price mechanisms,” Proceedings of the AAAI Conference on Artificial Intelligence 35 (5, 2021) 5219–5227. https://ojs.aaai.org/index.php/AAAI/article/view/16659.
- G. Brero, D. Chakrabarti, A. Eden, M. Gerstgrasser, V. Li, and D. Parkes, “Learning stackelberg equilibria in sequential price mechanisms,” Proc. ICML Workshop for Reinforcement Learning Theory, 2021.
- A. Skrzypacz and H. Hopenhayn, “Tacit collusion in repeated auctions,” Journal of Economic Theory 114 (1, 2004) 153–169.
- L. Waltman and U. Kaymak, “Q-learning agents in a cournot oligopoly model,” Journal of Economic Dynamics and Control 32 (10, 2008) 3275–3293.
- Springer-Verlag, 12, 2005.
- J. K. Gupta, M. Egorov, and M. Kochenderfer, “Cooperative multi-agent control using deep reinforcement learning,” pp. 66–83. 2017.
- J. Foerster, R. Y. Chen, M. Al-Shedivat, S. Whiteson, P. Abbeel, and I. Mordatch, “Learning with opponent-learning awareness,” Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 1 (2018) 122–130.
- M. Harper, V. Knight, M. Jones, G. Koutsovoulos, N. E. Glynatsi, and O. Campbell, “Reinforcement learning produces dominant strategies for the iterated prisoner’s dilemma,” PLOS ONE 12 (12, 2017) e0188046. https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0188046.
- V. Vassiliades, A. Cleanthous, and C. Christodoulou, “Multiagent reinforcement learning: Spiking and nonspiking agents in the iterated prisoner’s dilemma,” IEEE Transactions on Neural Networks 22 (4, 2011) 639–653.
- K. Li and D. Hao, “Cooperation enforcement and collusion resistance in repeated public goods games,” Proceedings of the AAAI Conference on Artificial Intelligence 33 (7, 2019) 2085–2092.
- R. Chaudhuri, K. Mukherjee, R. Narayanam, and R. D. Vallam, “Collaborative reinforcement learning framework to model evolution of cooperation in sequential social dilemmas,” Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 12712 LNAI (2021) 15–26. https://link.springer.com/chapter/10.1007/978-3-030-75762-5_2.
- M. R. Baye and D. Kovenock, “Bertrand competition,” The New Palgrave Dictionary of Economics (2008) 1–7. https://www.semanticscholar.org/paper/783221d378c9fe1ff3839b3ba5bf33c10e258705.
- Y. Zhou, S. Liu, Y. Qing, K. Chen, T. Zheng, Y. Huang, J. Song, and M. Song, “Is centralized training with decentralized execution framework centralized enough for marl?”. https://arxiv.org/abs/2305.17352v1.
- S. Sukhbaatar, A. Szlam, and R. Fergus, “Learning multiagent communication with backpropagation,” Advances in Neural Information Processing Systems (2016) 2252–2260.
- K. Zhang, Z. Yang, and T. Başar, “Multi-agent reinforcement learning: A selective overview of theories and algorithms,” Studies in Systems, Decision and Control 325 (2021) 321–384.
- P. Badjatiya, M. Sarkar, A. Sinha, S. Singh, N. Puri, and B. Krishnamurthy, “Inducing cooperation in multi-agent games through status-quo loss,” 1, 2020. http://arxiv.org/abs/2001.05458.
- H. V. Hasselt, A. Guez, and D. Silver, “Deep reinforcement learning with double q-learning,” pp. 2094–2100. 2016. www.aaai.org.
- C. Fischer and H. T. Normann, “Collusion and bargaining in asymmetric cournot duopoly—an experiment,” European Economic Review 111 (1, 2019) 360–379.
- J. Tirole, “The economics of tacit collusion,” Development (2003) .
- J. Miklós-Thal and C. Tucker, “Collusion by algorithm: Does better demand prediction facilitate coordination between sellers?” Management Science 65 (4, 2019) 1552–1561.
- A. Ezrachi and M. E. Stucke, “How pricing bots could form cartels and make things more expensive,” Havard Business Review (2016) . https://hbr.org/2016/10/how-pricing-bots-could-form-cartels-and-make-things-more-expensive?autocomplete=true.
- A. Ezrachi and S. M. E., “Virtual competition: The promise and perils of the algorithm-driven economy,” Havard Press University (2016) .
- E. J. Green, R. C. Marshall, and L. M. Marx, “Tacit collusion in oligopoly,” The Oxford Handbook of International Antitrust Economics 2 (2015) 1–25. www.oxfordhandbooks.com.
- J. E. Harrington, R. H. Gonzalez, and P. Kujal, “The relative efficacy of price announcements and express communication for collusion: Experimental findings,” Journal of Economic Behavior and Organization 128 (8, 2016) 251–264.
- M. A. Fonseca and H.-T. Normann, “Explicit vs. tacit collusion-the impact of communication in oligopoly experiments,” European Economic Review 56 (2012) 1759–1772.
- M. P. Wellman, K. Tuyls, and A. Greenwald, “Empirical game-theoretic analysis: A survey,”. https://arxiv.org/abs/2403.04018v1.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.