Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Reduction-based Framework for Sequential Decision Making with Delayed Feedback (2302.01477v5)

Published 3 Feb 2023 in cs.LG

Abstract: We study stochastic delayed feedback in general multi-agent sequential decision making, which includes bandits, single-agent Markov decision processes (MDPs), and Markov games (MGs). We propose a novel reduction-based framework, which turns any multi-batched algorithm for sequential decision making with instantaneous feedback into a sample-efficient algorithm that can handle stochastic delays in sequential decision making. By plugging different multi-batched algorithms into our framework, we provide several examples demonstrating that our framework not only matches or improves existing results for bandits, tabular MDPs, and tabular MGs, but also provides the first line of studies on delays in sequential decision making with function approximation. In summary, we provide a complete set of sharp results for multi-agent sequential decision making with delayed feedback.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yunchang Yang (6 papers)
  2. Han Zhong (38 papers)
  3. Tianhao Wu (68 papers)
  4. Bin Liu (441 papers)
  5. Liwei Wang (239 papers)
  6. Simon S. Du (120 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.