Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Regret Bounds for Batched Bandits (1910.04959v2)

Published 11 Oct 2019 in cs.DS and cs.LG

Abstract: We present simple and efficient algorithms for the batched stochastic multi-armed bandit and batched stochastic linear bandit problems. We prove bounds for their expected regrets that improve over the best-known regret bounds for any number of batches. In particular, our algorithms in both settings achieve the optimal expected regrets by using only a logarithmic number of batches. We also study the batched adversarial multi-armed bandit problem for the first time and find the optimal regret, up to logarithmic factors, of any algorithm with predetermined batch sizes.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Hossein Esfandiari (39 papers)
  2. Amin Karbasi (116 papers)
  3. Abbas Mehrabian (31 papers)
  4. Vahab Mirrokni (153 papers)
Citations (58)

Summary

We haven't generated a summary for this paper yet.