Establish empirical regularities of AI agents’ economic behavior
Ascertain the behavioral properties of large-language-model-based AI agents in economic settings by determining whether their preferences and beliefs are stable, steerable, and calibrated, and by rigorously measuring their performance in strategic and non-strategic decision tasks to reduce current uncertainty about their behavior.
References
Benchmarks for testing agent behavior \citep{fish2025econevals} can help bridge this gap but, at present, there is just a lot we don't know.
                — An Economy of AI Agents
                
                (2509.01063 - Hadfield et al., 1 Sep 2025) in Introduction and agent foundations, Subsection “A primer on AI agents”