Convex-Concave Zero-sum Markov Stackelberg Games (2401.12437v1)

Published 23 Jan 2024 in cs.GT

Abstract: Zero-sum Markov Stackelberg games can be used to model myriad problems, in domains ranging from economics to human robot interaction. In this paper, we develop policy gradient methods that solve these games in continuous state and action settings using noisy gradient estimates computed from observed trajectories of play. When the games are convex-concave, we prove that our algorithms converge to Stackelberg equilibrium in polynomial time. We also show that reach-avoid problems are naturally modeled as convex-concave zero-sum Markov Stackelberg games, and that Stackelberg equilibrium policies are more effective than their Nash counterparts in these problems.

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/0xkidwai/status/1750023038844780839

Convex-Concave Zero-sum Markov Stackelberg Games (2401.12437v1)

Summary

Related Papers

Tweets