Applicability of Algorithmic Information Ratio (AIR) in competitive multi-agent reinforcement learning
Investigate whether the Algorithmic Information Ratio (AIR) framework can be applied to competitive multi-agent reinforcement learning settings, specifically two-player zero-sum and multi-player general-sum Markov games, and determine corresponding regret guarantees and algorithmic formulations under AIR in these environments.
References
However, to the best of our knowledge, investigations into AIR are restricted to the simpler bandit and RL settings, while their applicability in the competitive MARL environment remains unknown.
— Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
(2404.19292 - Zhang et al., 30 Apr 2024) in Section 1: Introduction — Related works, Connections to AIR paragraph