Active Inference as a Model of Agency (2401.12917v1)
Abstract: Is there a canonical way to think of agency beyond reward maximisation? In this paper, we show that any type of behaviour complying with physically sound assumptions about how macroscopic biological agents interact with the world canonically integrates exploration and exploitation in the sense of minimising risk and ambiguity about states of the world. This description, known as active inference, refines the free energy principle, a popular descriptive framework for action and perception originating in neuroscience. Active inference provides a normative Bayesian framework to simulate and model agency that is widely used in behavioural neuroscience, reinforcement learning (RL) and robotics. The usefulness of active inference for RL is three-fold. \emph{a}) Active inference provides a principled solution to the exploration-exploitation dilemma that usefully simulates biological agency. \emph{b}) It provides an explainable recipe to simulate behaviour, whence behaviour follows as an explainable mixture of exploration and exploitation under a generative world model, and all differences in behaviour are explicit in differences in world model. \emph{c}) This framework is universal in the sense that it is theoretically possible to rewrite any RL algorithm conforming to the descriptive assumptions of active inference as an active inference algorithm. Thus, active inference can be used as a tool to uncover and compare the commitments and assumptions of more specific models of agency.
- A. Barto and R. Sutton. Reinforcement Learning: An Introduction. A Bradford Book, 1992.
- Jeremy Bentham. An Introduction to the Principles of Morals and Legislation. Dover Publications Inc., Mineola, N.Y, June 2007.
- J. Von Neumann and O. Morgenstern. Theory of Games and Economic Behavior. Princeton University Press, 1944.
- J. O. Berger. Statistical Decision Theory and Bayesian Analysis. Springer Series in Statistics. Springer-Verlag, New York, second edition, 1985.
- Richard E. Bellman. Dynamic Programming. Princeton University Press, Princeton, NJ, US, 1957.
- K. J Åström. Optimal control of Markov processes with incomplete state information. Journal of Mathematical Analysis and Applications, 10(1):174–205, February 1965.
- Jürgen Schmidhuber. A possibility for implementing curiosity and boredom in model-building neural controllers. In Proc. of the international conference on simulation of adaptive behavior: From animals to animats, pages 222–227, 1991.
- Curiosity-driven exploration by self-supervised prediction. In International conference on machine learning, pages 2778–2787. PMLR, 2017.
- Unifying count-based exploration and intrinsic motivation. Advances in neural information processing systems, 29, 2016.
- The free energy principle made simpler but not too simple. arXiv:2201.06387 [cond-mat, physics:nlin, physics:physics, q-bio], January 2022.
- K. Friston. The free-energy principle: A unified brain theory? Nature Reviews Neuroscience, 11(2):127–138, 2010.
- A free energy principle for the brain. J. Physiology-Paris, 100(1-3):70–87, 2006.
- The computational neurology of movement under active inference. Brain, March 2021.
- Hierarchical Active Inference: A Theory of Motivated Control. Trends in Cognitive Sciences, 22(4):294–306, April 2018.
- Computational mechanisms of curiosity and goal-directed exploration. eLife, page 45, 2019.
- A Bayesian computational model reveals a failure to adapt interoceptive precision estimates across depression, anxiety, eating, and substance use disorders. PLOS Computational Biology, 16(12):e1008484, December 2020.
- Canonical neural networks perform active inference. Communications Biology, 5(1):1–15, January 2022.
- Contrastive Active Inference. In Advances in Neural Information Processing Systems, May 2021.
- Deep active inference agents using Monte-Carlo methods. arXiv:2006.04176 [cs, q-bio, stat], June 2020.
- An empirical evaluation of active inference in multi-armed bandits. Neural Networks, 144:229–246, December 2021.
- Exploration and preference satisfaction trade-off in reward-free learning. arXiv preprint arXiv:2106.04316, 2021.
- Active Inference in Robotics and Artificial Agents: Survey and Challenges. arXiv:2112.01871 [cs], December 2021.
- How Active Inference Could Help Revolutionise Robotics. Entropy, 24(3):361, March 2022.
- Robot navigation as hierarchical active inference. Neural Networks, 142:192–204, October 2021.
- Cognitive and motor compliance in intentional human-robot interaction. In 2020 IEEE International Conference on Robotics and Automation (ICRA), pages 11291–11297, May 2020.
- D. V. Lindley. On a Measure of the Information Provided by an Experiment. The Annals of Mathematical Statistics, 27(4):986–1005, 1956.
- PID Control as a Process of Active Inference with Linear Generative Models. Entropy, 21(3):257, March 2019.
- Grigorios A. Pavliotis. Stochastic Processes and Applications: Diffusion Processes, the Fokker-Planck and Langevin Equations. Number volume 60 in Texts in Applied Mathematics. Springer, New York, 2014.
- Markov blankets, information geometry and stochastic thermodynamics. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 378(2164):20190159, February 2020.
- Luc Rey-Bellet. Open Classical Systems. In Stéphane Attal, Alain Joye, and Claude-Alain Pillet, editors, Open Quantum Systems II: The Markovian Approach, Lecture Notes in Mathematics, pages 41–78. Springer, Berlin, Heidelberg, 2006.
- The experience of agency: Feelings, judgments, and responsibility. Current Directions in Psychological Science, 18(4):242–246, 2009.
- Thomas Minka. Divergence measures and message passing. Technical report, 2005.
- Prospect Theory: An Analysis of Decision under Risk. Econometrica, 47(2):263–291, 1979.
- Sergey Levine. Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review. arXiv:1805.00909 [cs, stat], May 2018.
- On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference. In Twenty-Third International Joint Conference on Artificial Intelligence, June 2013.
- Marc Toussaint. Robot trajectory optimization using approximate inference. In Proceedings of the 26th Annual International Conference on Machine Learning, ICML ’09, pages 1049–1056, Montreal, Quebec, Canada, June 2009. Association for Computing Machinery.
- On the Relationship Between Active Inference and Control as Inference. In Tim Verbelen, Pablo Lanillos, Christopher L. Buckley, and Cedric De Boom, editors, Active Inference, Communications in Computer and Information Science, pages 3–11, Cham, 2020. Springer International Publishing.
- R. E. Kalman. A New Approach to Linear Filtering and Prediction Problems. Journal of Basic Engineering, 82(1):35–45, March 1960.
- Emanuel Todorov. General duality between optimal control and estimation. In 2008 47th IEEE Conference on Decision and Control, pages 4286–4292, December 2008.
- Optimal control as a graphical model inference problem. Machine Learning, 87(2):159–182, May 2012.
- B. Ziebart. Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy. PhD thesis, Carnegie Mellon University, Pittsburgh, 2010.
- Abraham Kaplan. The Conduct of Inquiry. Transaction Publishers, 1973.
- John S. Bridle. Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition. In Françoise Fogelman Soulié and Jeanny Hérault, editors, Neurocomputing, NATO ASI Series, pages 227–236, Berlin, Heidelberg, 1990. Springer.
- R. Duncan Luce. Individual Choice Behavior. Individual Choice Behavior. John Wiley, Oxford, England, 1959.
- Jürgen Schmidhuber. Formal Theory of Creativity, Fun, and Intrinsic Motivation (1990–2010). IEEE Transactions on Autonomous Mental Development, 2(3):230–247, September 2010.
- David J. C. MacKay. Information-Based Objective Functions for Active Data Selection. Neural Computation, 4(4):590–604, July 1992.
- What is Intrinsic Motivation? A Typology of Computational Approaches. Frontiers in Neurorobotics, 1:6, November 2007.
- Novelty or Surprise? Frontiers in Psychology, 4, 2013.
- Planning to Be Surprised: Optimal Bayesian Exploration in Dynamic Environments. arXiv:1103.5708 [cs, stat], March 2011.
- Intrinsic Motivation and Self-Determination in Human Behavior. Perspectives in Social Psychology. Springer US, New York, 1985.
- Bayesian surprise attracts human attention. Vision research, 49(10):1295–1306, May 2009.
- Generative Models for Active Vision. Frontiers in Neurorobotics, 15, 2021.
- H. B. Barlow. Possible Principles Underlying the Transformations of Sensory Messages. The MIT Press, 1961.
- R Linsker. Perceptual Neural Organization: Some Approaches Based on Network Models and Information Theory. Annual Review of Neuroscience, 13(1):257–281, 1990.
- Temporal encoding of two-dimensional patterns by single units in primate inferior temporal cortex. III. Information theoretic analysis. Journal of Neurophysiology, 57(1):162–178, January 1987.
- The Exploration-Exploitation Dilemma: A Multidisciplinary Framework. PLOS ONE, 9(4):e95693, April 2014.
- Ready policy one: World building through active learning. In International Conference on Machine Learning, pages 591–601. PMLR, 2020.
- Varibad: A very good method for bayes-adaptive deep rl via meta-learning. arXiv preprint arXiv:1910.08348, 2019.
- Dream to control: Learning behaviors by latent imagination. arXiv preprint arXiv:1912.01603, 2019.
- Action and Perception as Divergence Minimization. arXiv:2009.01791 [cs, math, stat], October 2020.
- Exploration in approximate hyper-state space for meta reinforcement learning. In International Conference on Machine Learning, pages 12991–13001. PMLR, 2021.
- Synthesizing 3d shapes via modeling multi-view depth maps and silhouettes with deep generative networks. CVPR, 2017.
- Deep temporal models and active inference. Neuroscience & Biobehavioral Reviews, 90:486–501, July 2018.
- 3dp3: 3d scene perception via probabilistic programming. NeurIPS, 2021.
- Toward the neural implementation of structure learning. Current Opinion in Neurobiology, 37:99–105, April 2016.
- Building Machines That Learn and Think Like People. arXiv:1604.00289 [cs, stat], April 2016.
- Bayesian Models of Conceptual Development: Learning as Building Models of the World. Annual Review of Developmental Psychology, 2(1):533–558, 2020.
- Learning a theory of causality. Psychological Review, 118(1):110–119, 2011.
- Geometric Methods for Sampling, Optimisation, Inference and Adaptive Agents. In Geometry and Statistics, number 46 in Handbook of Statistics. Academic Press, 2022.
- Active inference on discrete state-spaces: A synthesis. J. Math. Psychology, 99:102447, 2020.
- Pymdp: A Python library for active inference in discrete state spaces. arXiv:2201.03904 [cs, q-bio], January 2022.
- A step-by-step tutorial on active inference and its application to empirical data. Journal of Mathematical Psychology, 107:102632, April 2022.
- Active Inference: The Free Energy Principle in Mind, Brain, and Behavior. MIT Press, Cambridge, MA, USA, March 2022.
- Active Inference, Belief Propagation, and the Bethe Approximation. Neural Computation, 30(9):2530–2567, September 2018.
- Branching Time Active Inference: Empirical study and complexity class analysis. arXiv:2111.11276 [cs], November 2021.
- Branching Time Active Inference: The theory and its generality. arXiv:2111.11107 [cs], November 2021.
- Active Tree Search in Large POMDPs. arXiv:2103.13860 [cs, math, q-bio], March 2021.
- Active Inference: A Process Theory. Neural Computation, 29(1):1–49, January 2017.
- Active Inference: Demystified and Compared. Neural Computation, 33(3):674–712, January 2021.
- Active Inference, Curiosity and Insight. Neural Computation, 29(10):2633–2683, October 2017.
- Thomas Parr. The Computational Neurology of Active Vision. PhD thesis, University College London, London, 2019.
- B. Millidge. Deep active inference as variational policy gradients. J. Math. Psychology, 96:102348, 2020.
- Path integrals, particular kinds and strange things. 2022.
- Active inference, Bayesian optimal design, and expected utility. arXiv:2110.04074 [cs, math, stat], September 2021.