Polynomial regret for learning bounded contracts with many actions
Ascertain whether online learning of bounded contracts against a fixed agent admits polynomial regret bounds when the number of actions is polynomial in the number of outcomes m; and characterize regret guarantees when the agent is sampled afresh in each round.
References
It remains an open question to prove (or disprove) that the problem admits a polynomial regret bound when the number of actions is polynomial in m. It is also open what the corresponding regret bounds are when the agent is sampled afresh in each round.
— Algorithmic Contract Theory: A Survey
(2412.16384 - Duetting et al., 20 Dec 2024) in Section 6.2 (Improved Regret Bounds with a Small Number of Actions)