Adaptive Incentive Design with Learning Agents (2405.16716v2)
Abstract: How can the system operator learn an incentive mechanism that achieves social optimality based on limited information about the agents' behavior, who are dynamically updating their strategies? To answer this question, we propose an \emph{adaptive} incentive mechanism. This mechanism updates the incentives of agents based on the feedback of each agent's externality, evaluated as the difference between the player's marginal cost and society's marginal cost at each time step. The proposed mechanism updates the incentives on a slower timescale compared to the agents' learning dynamics, resulting in a two-timescale coupled dynamical system. Notably, this mechanism is agnostic to the specific learning dynamics used by agents to update their strategies. We show that any fixed point of this adaptive incentive mechanism corresponds to the optimal incentive mechanism, ensuring that the Nash equilibrium coincides with the socially optimal strategy. Additionally, we provide sufficient conditions that guarantee the convergence of the adaptive incentive mechanism to a fixed point. Our results apply to both atomic and non-atomic games. To demonstrate the effectiveness of our proposed mechanism, we verify the convergence conditions in two practically relevant games: atomic networked quadratic aggregative games and non-atomic network routing games.
- Aggregate comparative statics. Games and Economic Behavior, 81:27–49, 2013.
- Networks, shocks, and systemic risk. Technical report, National Bureau of Economic Research, 2015.
- Tamer Başar. Affine incentive schemes for stochastic systems with dynamic information. SIAM Journal on Control and Optimization, 22(2):199–210, 1984.
- Dynamic incentives for congestion control. IEEE Transactions on Automatic Control, 60(2):299–310, 2014.
- The Oxford handbook of the economics of networks. Oxford University Press, 2016.
- Learning optimal commitment to overcome insecurity. Advances in Neural Information Processing Systems, 27, 2014.
- Sample-efficient learning of stackelberg equilibria in general-sum games. Advances in Neural Information Processing Systems, 34:25799–25811, 2021.
- Public goods in networks. Journal of Economic theory, 135(1):478–494, 2007.
- Studies in the economics of transportation. Technical report, , 1956.
- Vivek S Borkar. Stochastic approximation with two time scales. Systems & Control Letters, 29(5):291–294, 1997.
- Vivek S Borkar. Stochastic approximation: a dynamical systems viewpoint, volume 48. Springer, 2009.
- Concentration bounds for two time scale stochastic approximation. In 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pages 504–511. IEEE, 2018.
- Learning optimal tax design in nonatomic congestion games. arXiv preprint arXiv:2402.07437, 2024.
- Distributed dynamic pricing of multiscale transportation networks. IEEE Transactions on Automatic Control, 2021.
- High-order decentralized pricing dynamics for congestion games: Harnessing coordination to achieve acceleration. In 2023 American Control Conference (ACC), pages 1086–1091. IEEE, 2023.
- Stella Dafermos. Sensitivity analysis in variational inequalities. Mathematics of Operations Research, 13(3):421–434, 1988.
- The theory of learning in games, volume 2. MIT press, 1998.
- Finite-dimensional variational inequalities and complementarity problems. Springer Science & Business Media, 2007.
- Sergio Grammatico. Dynamic control of agents playing aggregative games with coupling constraints. IEEE Transactions on Automatic Control, 62(9):4537–4548, 2017.
- Christopher Harris. On the rate of convergence of continuous-time fictitious play. Games and Economic Behavior, 22(2):238–259, 1998.
- Morris W Hirsch. Systems of differential equations that are competitive or cooperative ii: Convergence almost everywhere. SIAM Journal on Mathematical Analysis, 16(3):423–439, 1985.
- A control-theoretic view on incentives. Automatica, 18(2):167–179, 1982.
- On the global convergence of stochastic fictitious play. Econometrica, 70(6):2265–2294, 2002.
- Online learning for traffic routing under unknown preferences. arXiv preprint arXiv:2203.17150, 2022.
- Learning and approximating the optimal strategy to commit to. In Algorithmic Game Theory: Second International Symposium, SAGT 2009, Paphos, Cyprus, October 18-20, 2009. Proceedings 2, pages 250–262. Springer, 2009.
- Dan Levin. Taxation within cournot oligopoly. Journal of Public Economics, 27(3):281–290, 1985.
- Inducing equilibria via incentives: Simultaneous design-and-play finds global optima. arXiv preprint arXiv:2110.01212, 2021.
- Higher order game dynamics. Journal of Economic Theory, 148(6):2666–2695, 2013.
- Socially optimal energy usage via adaptive pricing. arXiv preprint arXiv:2310.13254, 2023.
- Inducing desired equilibrium in taxi repositioning problem with adaptive incentive design. In 2023 62nd IEEE Conference on Decision and Control (CDC), pages 8075–8080. IEEE, 2023.
- The weighted majority algorithm. Information and computation, 108(2):212–261, 1994.
- Dynamic tolling for inducing socially optimal traffic loads. arXiv preprint arXiv:2110.08879, 2021.
- Stackelberg population learning dynamics. In 2022 IEEE 61st Conference on Decision and Control (CDC), pages 6395–6400. IEEE, 2022.
- Fictitious play property for games with identical interests. Journal of economic theory, 68(1):258–265, 1996.
- Follower agnostic methods for stackelberg games. arXiv preprint arXiv:2302.01421, 2023.
- Learning in games with continuous action sets and unknown payoff functions. Mathematical Programming, 173(1):465–507, 2019.
- John H Nachbar. “evolutionary” selection dynamics in games: Convergence and limit properties. International journal of game theory, 19(1):59–89, 1990.
- High-performance optimal incentive-seeking in transactive control for traffic congestion. European Journal of Control, 68:100696, 2022.
- A class of distributed adaptive pricing mechanisms for societal systems with limited information. In 2017 IEEE 56th Annual Conference on Decision and Control (CDC), pages 1490–1495. IEEE, 2017.
- Learning optimal strategies to commit to. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 2149–2156, 2019.
- Adaptive incentive design. IEEE Transactions on Automatic Control, 66(8):3871–3878, 2020.
- J Ben Rosen. Existence and uniqueness of equilibrium points for concave n-person games. Econometrica: Journal of the Econometric Society, pages 520–534, 1965.
- Tim Roughgarden. Algorithmic game theory. Communications of the ACM, 53(7):78–86, 2010.
- William H Sandholm. Population games and evolutionary dynamics. MIT press, 2010.
- Shankar Sastry. Nonlinear systems: analysis, stability, and control, volume 10. Springer Science & Business Media, 2013.
- AE Scheidegger. Stability of motion, by nn krasovskii. translated from the (1959) russian ed. by jl brenner. stanford university press, 1963. Canadian Mathematical Bulletin, 7(1):151–151, 1964.
- Dynamic interventions with limited knowledge in network games. arXiv preprint arXiv:2205.15673, 2022.
- Steering the aggregative behavior of noncooperative agents: a nudge framework. Automatica, 136:110003, 2022.
- David Roger Smart. Fixed point theorems, volume 66. Cup Archive, 1980.
- On best-response dynamics in potential games. SIAM Journal on Control and Optimization, 56(4):2734–2767, 2018.
- Mathematical and economic theory of road pricing. Emerald Group Publishing Limited, 2005.
- Steering no-regret learners to optimal equilibria. arXiv preprint arXiv:2306.05221, 2023.
- Chinmay Maheshwari (20 papers)
- Kshitij Kulkarni (16 papers)
- Manxi Wu (25 papers)
- Shankar Sastry (30 papers)