Tight Bounds for Online Convex Optimization with Adversarial Constraints (2405.09296v1)

Published 15 May 2024 in cs.LG and math.OC

Abstract: A well-studied generalization of the standard online convex optimization (OCO) is constrained online convex optimization (COCO). In COCO, on every round, a convex cost function and a convex constraint function are revealed to the learner after the action for that round is chosen. The objective is to design an online policy that simultaneously achieves a small regret while ensuring small cumulative constraint violation (CCV) against an adaptive adversary. A long-standing open question in COCO is whether an online policy can simultaneously achieve $O(\sqrt{T})$ regret and $O(\sqrt{T})$ CCV without any restrictive assumptions. For the first time, we answer this in the affirmative and show that an online policy can simultaneously achieve $O(\sqrt{T})$ regret and $\tilde{O}(\sqrt{T})$ CCV. We establish this result by effectively combining the adaptive regret bound of the AdaGrad algorithm with Lyapunov optimization - a classic tool from control theory. Surprisingly, the analysis is short and elegant.

References (14)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/abhishek_tifr/status/1802744834131906933

https://twitter.com/abhishek_tifr/status/1790974476068478994

Tight Bounds for Online Convex Optimization with Adversarial Constraints (2405.09296v1)

Summary

Related Papers

Tweets