Infinite Ends from Finite Samples: Open-Ended Goal Inference as Top-Down Bayesian Filtering of Bottom-Up Proposals (2407.16770v1)
Abstract: The space of human goals is tremendously vast; and yet, from just a few moments of watching a scene or reading a story, we seem to spontaneously infer a range of plausible motivations for the people and characters involved. What explains this remarkable capacity for intuiting other agents' goals, despite the infinitude of ends they might pursue? And how does this cohere with our understanding of other people as approximately rational agents? In this paper, we introduce a sequential Monte Carlo model of open-ended goal inference, which combines top-down Bayesian inverse planning with bottom-up sampling based on the statistics of co-occurring subgoals. By proposing goal hypotheses related to the subgoals achieved by an agent, our model rapidly generates plausible goals without exhaustive search, then filters out goals that would be irrational given the actions taken so far. We validate this model in a goal inference task called Block Words, where participants try to guess the word that someone is stacking out of lettered blocks. In comparison to both heuristic bottom-up guessing and exact Bayesian inference over hundreds of goals, our model better predicts the mean, variance, efficiency, and resource rationality of human goal inferences, achieving similar accuracy to the exact model at a fraction of the cognitive cost, while also explaining garden-path effects that arise from misleading bottom-up cues. Our experiments thus highlight the importance of uniting top-down and bottom-up models for explaining the speed, accuracy, and generality of human theory-of-mind.
- (2021). Modeling the mistakes of boundedly rational agents within a Bayesian theory of mind. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 43).
- (2009). Action understanding as inverse planning. Cognition, 113(3), 329–349.
- (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72(1-2), 81–138.
- Beale, A. (2016). 12dicts package. Retrieved from http://wordlist.aspell.net/12dicts-readme/
- (2021). Sequential core-set Monte Carlo. In Uncertainty in artificial intelligence (pp. 2165–2175).
- (2013). A computational-level explanation of the speed of goal inference. Journal of Mathematical Psychology, 57(3-4), 117–133.
- (1997). Fast planning through planning graph analysis. Artificial Intelligence, 90(1-2), 281–300.
- (2023). Inferring the future by imagining the past. In Thirty-seventh Conference on Neural Information Processing Systems.
- Chopin, N. (2002). A sequential particle filter method for static models. Biometrika, 89(3), 539–552.
- (2019). Gen: A general-purpose probabilistic programming system with programmable inference. In Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation (pp. 221–236).
- (2008). The pigeon as particle filter. Advances in Neural Information Processing Systems, 20, 369–376.
- (2006). Sequential Monte Carlo samplers. Journal of the Royal Statistical Society Series B: Statistical Methodology, 68(3), 411–436.
- (2012). On adaptive resampling strategies for sequential Monte Carlo methods. Bernoulli, 18(1), 252–278.
- (2016). Adapting the number of particles in sequential Monte Carlo methods through an online scheme for convergence assessment. IEEE Transactions on Signal Processing, 65(7), 1781–1794.
- (2003). Teleological reasoning in infancy: The naıve theory of rational action. Trends in cognitive sciences, 7(7), 287–292.
- (1994). Constructing inferences during narrative text comprehension. Psychological review, 101(3), 371.
- (2006). Optimal predictions in everyday cognition. Psychological science, 17(9), 767–773.
- (2010). Storing the web in memory: Space efficient language models with constant time retrieval. In Proceedings of the 2010 conference on Empirical Methods in Natural Language Processing (pp. 262–272).
- (2005). Automatic goal inferences. Journal of Experimental Social Psychology, 41(2), 129–140.
- (1944). An experimental study of apparent behavior. The American Journal of Psychology, 57(2), 243–259.
- (2007). Improving lrta*(k). In Proceedings of the 20th International Joint Conference on Artifical Intelligence (pp. 2312–2317).
- (2015). Children’s understanding of the costs and rewards underlying rational action. Cognition, 140, 14–23.
- (2019). The naive utility calculus as a unified, quantitative framework for action understanding. PsyArXiv.
- (2016). Coordinate to cooperate or compete: Abstract goals and joint intentions in social interaction. In Proceedings of the Annual Meeting of the Cognitive Science Society.
- (2006). Real-Time Adaptive A*. In Proceedings of the Fifth International Joint Conference on Autonomous Agents and Multiagent Systems (pp. 281–288).
- (2009). Comparing real-time and incremental heuristic search for real-time situated agents. Autonomous Agents and Multi-Agent Systems, 18, 313–341.
- Korf, R. E. (1990). Real-time heuristic search. Artificial Intelligence, 42(2-3), 189–211.
- (2013). Bridging the gap between theory and practice of approximate bayesian inference. Cognitive Systems Research, 24, 2–8.
- (2022). Recursive monte carlo and variational inference with auxiliary variables. In Uncertainty in artificial intelligence (pp. 1096–1106).
- (2023). Probabilistic programming with stochastic probabilities. Proceedings of the ACM on Programming Languages, 7(PLDI), 1708–1732.
- (2023). SMCP3: Sequential Monte Carlo with probabilistic program proposals. In International Conference on Artificial Intelligence and Statistics (pp. 7061–7088).
- (2020). Resource-rational analysis: Understanding human cognition as the optimal use of limited computational resources. Behavioral and Brain Sciences, 43, e1.
- (2017). Ten-month-old infants infer the value of goals from the costs of actions. Science, 358(6366), 1038–1041.
- (1998). PDDL - the Planning Domain Definition Language (Tech. Rep.). Yale Center for Computational Vision and Control.
- (2021). Generating options and choosing between them depend on distinct forms of value representation. Psychological science, 32(11), 1731–1746.
- (2016). Spontaneous goal inference (sgi). Social and Personality Psychology Compass, 10(1), 64–80.
- (2019). How we know what not to think. Trends in cognitive sciences, 23(12), 1026–1040.
- (2009). Plan recognition as planning. In Twenty-first international joint conference on artificial intelligence.
- (2010). Probabilistic plan recognition using off-the-shelf classical planners. In Proceedings of the AAAI conference on artificial intelligence (Vol. 24).
- Schulz, L. (2012). Finding new facts; thinking new thoughts. Advances in child development and behavior, 43, 269–294.
- Speer, R. (2016). wordfreq: a Python library for looking up the frequencies of words in many languages. Retrieved from https://github.com/rspeer/wordfreq/
- (2017). Online learning of symbolic concepts. Journal of Mathematical Psychology, 77, 10–20.
- (2009). Help or hinder: Bayesian models of social goal inference. Advances in Neural Information Processing Systems, 22.
- (2012). Spontaneous goal inferences are often inferred faster than spontaneous trait inferences. Journal of Experimental Social Psychology, 48(1), 13–18.
- (2016). Online goal recognition through mirroring: Humans and agents. In Annual conference on advances in cognitive systems 2016.
- (2009). Explaining human multiple object tracking as resource-constrained approximate inference in a dynamic probabilistic model. Advances in Neural Information Processing Systems, 22.
- (2022). Symbolic Knowledge Distillation: from general language models to commonsense models. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (pp. 4602–4625). Seattle, United States: Association for Computational Linguistics.
- (2023). Inferring the goals of communicating agents from actions and instructions. ICML 2023 Workshop on Theory of Mind in Communicating Agents.
- Zhi-Xuan, T. (2020). GenParticleFilters.jl. Retrieved from https://github.com/probcomp/GenParticleFilters.jl
- Zhi-Xuan, T. (2022). PDDL.jl: An extensible interpreter and compiler interface for fast and flexible AI planning. Unpublished master’s thesis, MIT.
- (2020). Online Bayesian goal inference for boundedly rational planning agents. Advances in Neural Information Processing Systems, 33.
- (2024). Pragmatic instruction following and goal assistance via cooperative language guided inverse plan search. In Proceedings of the 23rd International Conference on Autonomous Agents and Multi-Agent Systems.
- Tan Zhi-Xuan (22 papers)
- Gloria Kang (1 paper)
- Vikash Mansinghka (31 papers)
- Joshua B. Tenenbaum (257 papers)