Do LLM Agents Have Regret? A Case Study in Online Learning and Games (2403.16843v3)

Published 25 Mar 2024 in cs.LG, cs.AI, and cs.GT

Abstract: LLMs have been increasingly employed for (interactive) decision-making, via the development of LLM-based autonomous agents. Despite their emerging successes, the performance of LLM agents in decision-making has not been fully investigated through quantitative metrics, especially in the multi-agent setting when they interact with each other, a typical scenario in real-world LLM-agent applications. To better understand the limits of LLM agents in these interactive environments, we propose to study their interactions in benchmark decision-making settings in online learning and game theory, through the performance metric of \emph{regret}. We first empirically study the {no-regret} behaviors of LLMs in canonical (non-stationary) online learning problems, as well as the emergence of equilibria when LLM agents interact through playing repeated games. We then provide some theoretical insights into the no-regret behaviors of LLM agents, under certain assumptions on the supervised pre-training and the rationality model of human decision-makers who generate the data. Notably, we also identify (simple) cases where advanced LLMs such as GPT-4 fail to be no-regret. To promote the no-regret behaviors, we propose a novel \emph{unsupervised} training loss of \emph{regret-loss}, which, in contrast to the supervised pre-training loss, does not require the labels of (optimal) actions. We then establish the statistical guarantee of generalization bound for regret-loss minimization, followed by the optimization guarantee that minimizing such a loss may automatically lead to known no-regret learning algorithms. Our further experiments demonstrate the effectiveness of our regret-loss, especially in addressing the above ``regrettable'' cases.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Chanwoo Park (24 papers)
Xiangyu Liu (47 papers)
Asuman Ozdaglar (102 papers)
Kaiqing Zhang (70 papers)

Citations (9)

View on Semantic Scholar

Tweets

https://twitter.com/RELflintashery/status/1772665222777409796

https://twitter.com/chanwoopark20/status/1799295238005215703

https://twitter.com/EugeneVinitsky/status/1842954349938651291

https://twitter.com/gm8xx8/status/1772464586299503017

https://twitter.com/econ_cs/status/1772473837189234744

https://twitter.com/econ_cs/status/1795304456520618485

YouTube

Show All Videos

Do LLM Agents Have Regret? A Case Study in Online Learning and Games (2403.16843v3)

Related Papers

Tweets

YouTube