Learning How to Strategically Disclose Information (2403.08741v1)
Abstract: Strategic information disclosure, in its simplest form, considers a game between an information provider (sender) who has access to some private information that an information receiver is interested in. While the receiver takes an action that affects the utilities of both players, the sender can design information (or modify beliefs) of the receiver through signal commitment, hence posing a Stackelberg game. However, obtaining a Stackelberg equilibrium for this game traditionally requires the sender to have access to the receiver's objective. In this work, we consider an online version of information design where a sender interacts with a receiver of an unknown type who is adversarially chosen at each round. Restricting attention to Gaussian prior and quadratic costs for the sender and the receiver, we show that $\mathcal{O}(\sqrt{T})$ regret is achievable with full information feedback, where $T$ is the total number of interactions between the sender and the receiver. Further, we propose a novel parametrization that allows the sender to achieve $\mathcal{O}(\sqrt{T})$ regret for a general convex utility function. We then consider the Bayesian Persuasion problem with an additional cost term in the objective function, which penalizes signaling policies that are more informative and obtain $\mathcal{O}(\log(T))$ regret. Finally, we establish a sublinear regret bound for the partial information feedback setting and provide simulations to support our theoretical results.
- E. Kamenica and M. Gentzkow. Bayesian persuasion. American Economic Review, 101(6):2590–2615, 2011.
- V. P. Crawford and J. Sobel. Strategic information transmission. Econometrica: Journal of the Econometric Society, pages 1431–1451, 1982.
- T. Başar and G. J. Olsder. Dynamic Noncooperative Game Theory. SIAM, 1998.
- M. Gentzkow and E. Kamenica. Competition in persuasion. The Review of Economic Studies, 84(1):300–322, 2016.
- Y. Wang. Bayesian persuasion with multiple receivers. Available at SSRN 2625399, 2013.
- E. Kamenica. Bayesian persuasion and information design. Annual Review of Economics, 11:249–272, 2019.
- M. Gentzkow and E. Kamenica. Costly persuasion. American Economic Review, 104(5):457–462, 2014.
- W. Tamura. Bayesian persuasion with quadratic preferences. Available at SSRN 1987877, 2018.
- Hierarchical multistage Gaussian signaling games in noncooperative communication and control systems. Automatica, 107:9–20, 2019.
- Value of information in games with multiple strategic information providers. arXiv preprint arXiv:2306.14886, 2023.
- Quadratic multi-dimensional signaling games and affine equilibria. IEEE Transactions on Automatic Control, 62(2):605–619, 2016.
- Online Bayesian persuasion. Advances in Neural Information Processing Systems, 33:16188–16198, 2020.
- Near-optimal algorithms for online matrix prediction. In Conference on Learning Theory, pages 38–1. JMLR Workshop and Conference Proceedings, 2012.
- Commitment without regrets: Online learning in Stackelberg security games. In Proceedings of the Sixteenth ACM Conference on Economics and Computation, pages 61–78, 2015.
- M. O. Sayin and T. Başar. Persuasion-based robust sensor design against attackers with unknown control objectives. IEEE Transactions on Automatic Control, 66(10):4589–4603, 2021.
- Information theory and statistics. Elements of Information Theory, 1(1):279–335, 1991.
- P. Christiano. Online local learning via semidefinite programming. In Proceedings of the Forty-sixth Annual ACM Symposium on Theory of Computing, pages 468–474, 2014.
- Online linear optimization with the log-determinant regularizer. IEICE Transactions on Information and Systems, 101(6):1511–1520, 2018.
- M. Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the 20th International Conference on Machine Learning (ICML-03), pages 928–936, 2003.
- Logarithmic regret algorithms for online convex optimization. Machine Learning, 69(2):169–192, 2007.
- E. Hazan and S. Kale. Projection-free online learning. arXiv preprint arXiv:1206.4657, 2012.
- E. Hazan et al. Introduction to online convex optimization. Foundations and Trends® in Optimization, 2(3-4):157–325, 2016.
- A. Kalai and S. Vempala. Efficient algorithms for online decision problems. Journal of Computer and System Sciences, 71(3):291–307, 2005.
- C. Carathéodory. Über den variabilitätsbereich der fourier’schen konstanten von positiven harmonischen funktionen. Rendiconti Del Circolo Matematico di Palermo (1884-1940), 32(1):193–217, 1911.