Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

On-Demand Communication for Asynchronous Multi-Agent Bandits (2302.07446v2)

Published 15 Feb 2023 in cs.LG, cs.DC, cs.MA, and stat.ML

Abstract: This paper studies a cooperative multi-agent multi-armed stochastic bandit problem where agents operate asynchronously -- agent pull times and rates are unknown, irregular, and heterogeneous -- and face the same instance of a K-armed bandit problem. Agents can share reward information to speed up the learning process at additional communication costs. We propose ODC, an on-demand communication protocol that tailors the communication of each pair of agents based on their empirical pull times. ODC is efficient when the pull times of agents are highly heterogeneous, and its communication complexity depends on the empirical pull times of agents. ODC is a generic protocol that can be integrated into most cooperative bandit algorithms without degrading their performance. We then incorporate ODC into the natural extensions of UCB and AAE algorithms and propose two communication-efficient cooperative algorithms. Our analysis shows that both algorithms are near-optimal in regret.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Yu-Zhen Janice Chen (8 papers)
  2. Lin Yang (212 papers)
  3. Xuchuang Wang (18 papers)
  4. Xutong Liu (28 papers)
  5. Mohammad Hajiesmaili (47 papers)
  6. John C. S. Lui (112 papers)
  7. Don Towsley (177 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.