Diverse Conventions for Human-AI Collaboration (2310.15414v1)

Published 24 Oct 2023 in cs.AI, cs.LG, and cs.MA

Abstract: Conventions are crucial for strong performance in cooperative multi-agent games, because they allow players to coordinate on a shared strategy without explicit communication. Unfortunately, standard multi-agent reinforcement learning techniques, such as self-play, converge to conventions that are arbitrary and non-diverse, leading to poor generalization when interacting with new partners. In this work, we present a technique for generating diverse conventions by (1) maximizing their rewards during self-play, while (2) minimizing their rewards when playing with previously discovered conventions (cross-play), stimulating conventions to be semantically different. To ensure that learned policies act in good faith despite the adversarial optimization of cross-play, we introduce \emph{mixed-play}, where an initial state is randomly generated by sampling self-play and cross-play transitions and the player learns to maximize the self-play reward from this initial state. We analyze the benefits of our technique on various multi-agent collaborative games, including Overcooked, and find that our technique can adapt to the conventions of humans, surpassing human-level performance when paired with real users.

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Related Papers

Generalized Beliefs for Cooperative AI (2022)
Collaborating with Humans without Human Data (2021)
"Other-Play" for Zero-Shot Coordination (2020)
Adversarially Guided Self-Play for Adopting Social Conventions (2020)
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination (2022)