Extent and Modulation of LLM Behavioral Consistency with Human Decision-Making
Determine the extent to which large language models exhibit behavior consistent with human decision-making, and ascertain whether their behavior can be modulated through targeted interventions.
References
While LLMs now match or surpass human accuracy on standard reasoning benchmarks , their ability to reproduce these stochastic patterns remains an open question:
To what extent do LLMs exhibit behavior consistent with human decision-making, and can this behavior be modulated through targeted interventions?
                — Noise, Adaptation, and Strategy: Assessing LLM Fidelity in Decision-Making
                
                (2508.15926 - Feng et al., 21 Aug 2025) in Introduction (Section 1), immediately preceding and including the boxed question