Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Online learning with graph-structured feedback against adaptive adversaries (1804.00335v1)

Published 1 Apr 2018 in cs.LG, cs.IT, math.IT, and stat.ML

Abstract: We derive upper and lower bounds for the policy regret of $T$-round online learning problems with graph-structured feedback, where the adversary is nonoblivious but assumed to have a bounded memory. We obtain upper bounds of $\widetilde O(T{2/3})$ and $\widetilde O(T{3/4})$ for strongly-observable and weakly-observable graphs, respectively, based on analyzing a variant of the Exp3 algorithm. When the adversary is allowed a bounded memory of size 1, we show that a matching lower bound of $\widetilde\Omega(T{2/3})$ is achieved in the case of full-information feedback. We also study the particular loss structure of an oblivious adversary with switching costs, and show that in such a setting, non-revealing strongly-observable feedback graphs achieve a lower bound of $\widetilde\Omega(T{2/3})$, as well.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Zhili Feng (22 papers)
  2. Po-Ling Loh (43 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.