LLM-driven Imitation of Subrational Behavior : Illusion or Reality? (2402.08755v1)
Abstract: Modeling subrational agents, such as humans or economic households, is inherently challenging due to the difficulty in calibrating reinforcement learning models or collecting data that involves human subjects. Existing work highlights the ability of LLMs to address complex reasoning tasks and mimic human communication, while simulation using LLMs as agents shows emergent social behaviors, potentially improving our comprehension of human conduct. In this paper, we propose to investigate the use of LLMs to generate synthetic human demonstrations, which are then used to learn subrational agent policies though Imitation Learning. We make an assumption that LLMs can be used as implicit computational models of humans, and propose a framework to use synthetic demonstrations derived from LLMs to model subrational behaviors that are characteristic of humans (e.g., myopic behavior or preference for risk aversion). We experimentally evaluate the ability of our framework to model sub-rationality through four simple scenarios, including the well-researched ultimatum game and marshmallow experiment. To gain confidence in our framework, we are able to replicate well-established findings from prior human studies associated with the above scenarios. We conclude by discussing the potential benefits, challenges and limitations of our framework.
- Self-selection into laboratory experiments: pro-social motives versus monetary incentives. Experimental Economics, 18:195–214, 2015.
- Using large language models to simulate multiple humans and replicate human subject studies. In International Conference on Machine Learning, pp. 337–371. PMLR, 2023.
- Ainslie, G. Picoeconomics: The strategic interaction of successive motivational states within the person. Cambridge University Press, 1992.
- Animal spirits: How human psychology drives the economy, and why it matters for global capitalism. Princeton university press, 2010.
- The hyperbolic consumption model: Calibration, simulation, and empirical evaluation. Journal of Economic perspectives, 15(3):47–68, 2001.
- Out of one, many: Using language models to simulate human samples. Political Analysis, 31(3):337–351, 2023.
- Barberis, N. C. Thirty years of prospect theory in economics: A review and assessment. Journal of economic perspectives, 27(1):173–196, 2013.
- Myopic loss aversion and the equity premium puzzle. The quarterly journal of Economics, 110(1):73–92, 1995.
- Present-bias, quasi-hyperbolic discounting, and fixed costs. Games and economic behavior, 69(2):205–223, 2010.
- On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258, 2021.
- Trades, quotes and prices: financial markets under the microscope. Cambridge University Press, 2018.
- Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901, 2020.
- Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv preprint arXiv:2303.12712, 2023.
- On the utility of learning about humans for human-ai coordination. Advances in neural information processing systems, 32, 2019.
- Intertemporal choice. In Behavioural and experimental economics, pp. 168–177. Springer, 2010.
- Human irrationality: both bad and good for reward inference. arXiv preprint arXiv:2111.06956, 2021.
- Can large language models be an alternative to human evaluations? arXiv preprint arXiv:2305.01937, 2023.
- Learning to simulate realistic limit order book markets from data as a world agent. In Proceedings of the Third ACM International Conference on AI in Finance, pp. 428–436, 2022.
- K-SHAP: Policy clustering algorithm for anonymous multi-agent state-action pairs. In Proceedings of the 40th International Conference on Machine Learning, pp. 6343–6363. PMLR, 23–29 Jul 2023. URL https://proceedings.mlr.press/v202/coletta23a.html.
- A survey of demonstration learning. arXiv preprint arXiv:2303.11191, 2023.
- Patterns of academic procrastination. Journal of College Reading and Learning, 30(2):120–134, 2000.
- Desposato, S. Ethics and experiments: Problems and solutions for social scientists and policy professionals. Routledge, 2015.
- Analyzing the impact of tax credits on households in simulated economic systems with learning agents. arXiv preprint arXiv:2311.17252, 2023.
- Effects of robot motion on human-robot collaboration. In Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction, pp. 51–58, 2015.
- The economy needs agent-based modelling. Nature, 460(7256):685–686, 2009.
- Towards artificial general intelligence via a multimodal foundation model. Nature Communications, 13(1):3094, 2022.
- Time discounting and time preference: A critical review. Journal of economic literature, 40(2):351–401, 2002.
- Household energy use: Applying behavioural economics to understand consumer decision-making and behaviour. Renewable and Sustainable Energy Reviews, 41:1385–1394, 2015.
- S33{}^{3}start_FLOATSUPERSCRIPT 3 end_FLOATSUPERSCRIPT: Social-network simulation system with large language model-empowered agents. arXiv preprint arXiv:2307.14984, 2023.
- Rate of temporal discounting decreases with amount of reward. Memory & cognition, 25:715–723, 1997.
- How close is chatgpt to human experts? comparison corpus, evaluation, and detection. arXiv preprint arXiv:2301.07597, 2023.
- An experimental analysis of ultimatum bargaining. Journal of economic behavior & organization, 3(4):367–388, 1982.
- World models. arXiv preprint arXiv:1803.10122, 2018.
- Machine intuition: Uncovering human-like intuitive decision-making in gpt-3.5. arXiv preprint arXiv:2212.05206, 2022.
- Measuring mathematical problem solving with the math dataset. arXiv preprint arXiv:2103.03874, 2021.
- Deep q-learning from demonstrations. In Proceedings of the AAAI conference on artificial intelligence, volume 32, 2018.
- Generative adversarial imitation learning. Advances in neural information processing systems, 29, 2016.
- Large language models are reasoning teachers. arXiv preprint arXiv:2212.10071, 2022.
- Horton, J. J. Large language models as simulated economic agents: What can we learn from homo silicus? Technical report, National Bureau of Economic Research, 2023.
- Neuroeconomics: Chapter 2. Experimental Economics and Experimental Game Theory. Elsevier Inc. Chapters, 2013.
- Model-based imitation learning for urban driving. Advances in Neural Information Processing Systems, 35:20703–20716, 2022.
- Towards reasoning in large language models: A survey. arXiv preprint arXiv:2212.10403, 2022.
- Imitation learning: A survey of learning methods. ACM Computing Surveys (CSUR), 50(2):1–35, 2017.
- Prospect theory: An analysis of decision under risk. Econometrica, 47(2):263–292, 1979.
- Large language models are zero-shot reasoners. Advances in neural information processing systems, 35:22199–22213, 2022.
- Korinek, A. Language models and cognitive automation for economic research. Technical report, National Bureau of Economic Research, 2023.
- Krawczyk, D. C. Social cognition. In Reasoning: The neuroscience of how we think, pp. 283–311. Elsevier, 2018.
- Kuhn, T. S. The structure of scientific revolutions. University of Chicago press, 1964.
- Reward design with language models. In The Eleventh International Conference on Learning Representations, 2022.
- Laibson, D. I. Hyperbolic discounting and consumption. PhD thesis, Massachusetts Institute of Technology, 1994.
- Langer, E. J. The illusion of control. Journal of personality and social psychology, 32(2):311, 1975.
- LeBaron, B. Agent-based computational finance. Handbook of computational economics, 2:1187–1233, 2006.
- Biased or limited: Modeling sub-rational human investors in financial markets. arXiv preprint arXiv:2210.08569, 2022.
- Deep learning, reinforcement learning, and world models. Neural Networks, 152:267–275, 2022.
- Recent advances in natural language processing via large pre-trained language models: A survey. ACM Computing Surveys, 2021.
- Attention in delay of gratification. Journal of personality and social psychology, 16(2):329, 1970.
- Algorithms for inverse reinforcement learning. In Icml, volume 1, pp. 2, 2000.
- Doing it now or later. American economic review, 89(1):103–124, 1999.
- OpenAI. Gpt-4 technical report, 2023a.
- OpenAI. Introducing chatgpt, 2023b. URL https://openai.com/blog/chatgpt. Accessed on: 2023-05-10.
- Social simulacra: Creating populated prototypes for social computing systems. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, pp. 1–18, 2022.
- Generative agents: Interactive simulacra of human behavior. arXiv preprint arXiv:2304.03442, 2023.
- Pollak, R. A. Consistent planning. The Review of Economic Studies, 35(2):201–208, 1968.
- Where do you think you’re going?: Inferring beliefs about dynamics from behavior. Advances in Neural Information Processing Systems, 31, 2018.
- Prompt programming for large language models: Beyond the few-shot paradigm. In Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems, pp. 1–7, 2021.
- Affective, cognitive, and behavioral differences between high and low procrastinators. Journal of counseling psychology, 33(4):387, 1986.
- The neural basis of economic decision-making in the ultimatum game. Science, 300(5626):1755–1758, 2003.
- Large pre-trained language models contain human-like biases of what is right and wrong to do. Nature Machine Intelligence, 4(3):258–268, 2022.
- Sejnowski, T. J. Large language models and the reverse turing test. Neural computation, 35(3):309–342, 2023.
- Numeric magnitude comparison effects in large language models. arXiv preprint arXiv:2305.10782, 2023.
- Neural mechanisms mediating optimism bias. Nature, 450(7166):102–105, 2007.
- Simon, H. A. A behavioral model of rational choice. The quarterly journal of economics, pp. 99–118, 1955.
- Simon, H. A. Models of bounded rationality: Empirically grounded economic reason, volume 3. MIT press, 1997.
- Academic procrastination: Frequency and cognitive-behavioral correlates. Journal of counseling psychology, 31(4):503, 1984.
- Reinforcement learning: An introduction. MIT press, 2018.
- Thaler, R. H. Anomalies: The ultimatum game. Journal of economic perspectives, 2(4):195–206, 1988.
- The effect of myopia and loss aversion on risk taking: An experimental test. The quarterly journal of economics, 112(2):647–661, 1997.
- Advances in prospect theory: Cumulative representation of uncertainty. Journal of Risk and uncertainty, 5:297–323, 1992.
- Emergent abilities of large language models. Transactions on Machine Learning Research, 2022a.
- Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems, 35:24824–24837, 2022b.
- Epidemic modeling with generative agents. arXiv preprint arXiv:2307.04986, 2023.
- Can large language models transform computational social science? arXiv preprint arXiv:2305.03514, 2023.