On efficient computation in active inference
Abstract: Despite being recognized as neurobiologically plausible, active inference faces difficulties when employed to simulate intelligent behaviour in complex environments due to its computational cost and the difficulty of specifying an appropriate target distribution for the agent. This paper introduces two solutions that work in concert to address these limitations. First, we present a novel planning algorithm for finite temporal horizons with drastically lower computational complexity. Second, inspired by Z-learning from control theory literature, we simplify the process of setting an appropriate target distribution for new and existing active inference planning schemes. Our first approach leverages the dynamic programming algorithm, known for its computational efficiency, to minimize the cost function used in planning through the Bellman-optimality principle. Accordingly, our algorithm recursively assesses the expected free energy of actions in the reverse temporal order. This improves computational efficiency by orders of magnitude and allows precise model learning and planning, even under uncertain conditions. Our method simplifies the planning process and shows meaningful behaviour even when specifying only the agent's final goal state. The proposed solutions make defining a target distribution from a goal state straightforward compared to the more complicated task of defining a temporally informed target distribution. The effectiveness of these methods is tested and demonstrated through simulations in standard grid-world tasks. These advances create new opportunities for various applications.
- Reinforcement Learning: An Introduction. The MIT Press, second edition, 2018. URL http://incompleteideas.net/book/the-book-2nd.html.
- Emanuel Todorov. Linearly-solvable markov decision problems. In B. Schölkopf, J. Platt, and T. Hoffman, editors, Advances in Neural Information Processing Systems, volume 19. MIT Press, 2006. URL https://proceedings.neurips.cc/paper/2006/file/d806ca13ca3449af72a1ea5aedbed26a-Paper.pdf.
- Emanuel Todorov. Efficient computation of optimal actions. Proceedings of the National Academy of Sciences, 106(28):11478–11483, 2009. doi:10.1073/pnas.0710743106. URL https://www.pnas.org/doi/abs/10.1073/pnas.0710743106.
- Game theory. MIT press, 1991.
- Online distributed algorithms for seeking generalized nash equilibria in dynamic environments. IEEE Transactions on Automatic Control, 66(5):2289–2296, 2020.
- Dilip Mookherjee. Optimal incentive schemes with many agents. The Review of Economic Studies, 51(3):433–446, 1984. ISSN 00346527, 1467937X. URL http://www.jstor.org/stable/2297432.
- J. Von Neumann and O. Morgenstern. Theory of Games and Economic Behavior. Theory of Games and Economic Behavior. Princeton University Press, Princeton, NJ, US, 1944.
- Jeremy Bentham. An Introduction to the Principles of Morals and Legislation. History of Economic Thought Books, 1781.
- John Stuart Mill. Utilitarianism. Longmans, Green and Company, 1870. ISBN 978-1-4992-5302-3.
- P Ivan Pavlov (1927). Conditioned reflexes: An investigation of the physiological activity of the cerebral cortex. Annals of Neurosciences, 17(3):136–141, July 2010. ISSN 0972-7531. doi:10.5214/ans.0972-7531.1017309.
- Q-learning. Machine learning, 8:279–292, 1992.
- Efficient learning and planning within the dyna framework. Adaptive behavior, 1(4):437–454, 1993.
- Ben Goertzel. Artificial general intelligence: concept, state of the art, and future prospects. Journal of Artificial General Intelligence, 5(1):1, 2014.
- Samuel J Gershman. What have we learned about artificial intelligence from studying the brain? https://gershmanlab.com/pubs/NeuroAI_critique.pdf, 2023. Accessed: 2023-05-29.
- Richard Feynman. Statistical Mechanics: A Set Of Lectures. Westview Press, Boulder, Colo, 1st edition edition, March 1998. ISBN 978-0-201-36076-9.
- The Helmholtz Machine. Neural Computation, 7(5):889–904, September 1995. ISSN 0899-7667, 1530-888X. doi:10.1162/neco.1995.7.5.889.
- Karl Friston. The free-energy principle: A unified brain theory? Nature Reviews Neuroscience, 11(2):127–138, February 2010. ISSN 1471-003X, 1471-0048. doi:10.1038/nrn2787.
- The free energy principle made simpler but not too simple. arXiv:2201.06387 [cond-mat, physics:nlin, physics:physics, q-bio], January 2022.
- Reward Maximization Through Discrete Active Inference. Neural Computation, 35(5):807–852, April 2023. ISSN 0899-7667. doi:10.1162/neco_a_01574.
- Active inference on discrete state-spaces: A synthesis. Journal of Mathematical Psychology, 99:102447, 2020. ISSN 0022-2496. doi:10.1016/j.jmp.2020.102447. URL https://www.sciencedirect.com/science/article/pii/S0022249620300857.
- Active inference and behavior trees for reactive action planning and execution in robotics. IEEE Transactions on Robotics, 39(2):1050–1069, 2023. doi:10.1109/TRO.2022.3226144.
- An empirical study of active inference on a humanoid robot. IEEE Transactions on Cognitive and Developmental Systems, 14(2):462–471, jun 2022. doi:10.1109/tcds.2021.3049907. URL https://doi.org/10.1109.2021.3049907.
- Losing ourselves: Active inference, depersonalization, and meditation. Frontiers in Psychology, 11, 2020. ISSN 1664-1078. doi:10.3389/fpsyg.2020.539726. URL https://www.frontiersin.org/articles/10.3389/fpsyg.2020.539726.
- Sergio Rubin. Future climates: Markov blankets and active inference in the biosphere. Journal of The Royal Society Interface, 17:20200503, 11 2020. doi:10.1098/rsif.2020.0503.
- Deep active inference agents using Monte-Carlo methods. arXiv:2006.04176 [cs, q-bio, stat], June 2020.
- Goal-directed planning and goal understanding by extended active inference: Evaluation through simulated and physical robot experiments. Entropy, 24(4):469, 2022.
- Reinforcement learning or active inference? PLOS ONE, 4(7):1–13, 07 2009. doi:10.1371/journal.pone.0006421. URL https://doi.org/10.1371/journal.pone.0006421.
- Karl Friston. A free energy principle for biological systems. Entropy (Basel, Switzerland), 14:2100–2121, 11 2012. doi:10.3390/e14112100.
- Active inference: Demystified and compared. Neural Computation, 33(3):674–712, January 2021a. ISSN 0899-7667. doi:10.1162/neco_a_01357. URL https://doi.org/10.1162/neco_a_01357.
- The free energy principle for perception and action: A deep learning perspective. Entropy, 24(2):301, 2022.
- On the relationship between active inference and control as inference. In Tim Verbelen, Pablo Lanillos, Christopher L. Buckley, and Cedric De Boom, editors, Active Inference, pages 3–11, Cham, 2020. Springer International Publishing. ISBN 978-3-030-64919-7. doi:https://link.springer.com/chapter/10.1007/978-3-030-64919-7_1.
- Designing explainable artificial intelligence with active inference: A framework for transparent introspection and decision-making, 2023.
- Sophisticated inference. Neural Computation, 33(3):713–763, February 2021. ISSN 0899-7667. doi:10.1162/neco_a_01351. URL https://doi.org/10.1162/neco_a_01351.
- Planning and navigation as active inference. Biological Cybernetics, 112(4):323–343, 2018. ISSN 1432-0770. doi:10.1007/s00422-018-0753-2. URL https://doi.org/10.1007/s00422-018-0753-2.
- Morphogenesis as bayesian inference: A variational approach to pattern formation and control in complex biological systems. Physics of Life Reviews, 33:88–108, 2020. ISSN 1571-0645. doi:https://doi.org/10.1016/j.plrev.2019.06.001. URL https://www.sciencedirect.com/science/article/pii/S1571064519300909.
- Learning action-oriented models through active inference. PLOS Computational Biology, 16(4):1–30, 04 2020. doi:10.1371/journal.pcbi.1007805. URL https://doi.org/10.1371/journal.pcbi.1007805.
- The discrete and continuous brain: From decisions to movement-and back again. Neural computation, 30(29894658):2319–2347, September 2018. ISSN 0899-7667. doi:10.1162/neco_a_01102. URL https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6115199/.
- William S. Lovejoy. A survey of algorithmic methods for partially observed markov decision processes. Annals of Operations Research, 28(1):47–65, 1991. ISSN 1572-9338. doi:10.1007/BF02055574. URL https://doi.org/10.1007/BF02055574.
- A survey of point-based pomdp solvers. Autonomous Agents and Multi-Agent Systems, 27(1):1–51, 2013. ISSN 1573-7454. doi:10.1007/s10458-012-9200-2. URL https://doi.org/10.1007/s10458-012-9200-2.
- Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1):99–134, 1998. ISSN 0004-3702. doi:https://doi.org/10.1016/S0004-3702(98)00023-X. URL https://www.sciencedirect.com/science/article/pii/S000437029800023X.
- Free-energy minimization in joint agent-environment systems: A niche construction perspective. Journal of theoretical biology, 455:161–178, 2018.
- Exploration and preference satisfaction trade-off in reward-free learning. In ICML 2021 Workshop on Unsupervised Reinforcement Learning, 2021b.
- Karl Friston. A free energy principle for a particular physics, 2019.
- Variational Inference: A Review for Statisticians. Journal of the American Statistical Association, 112(518):859–877, April 2017. ISSN 0162-1459, 1537-274X. doi:10.1080/01621459.2017.1285773.
- Bayesian brains and the rényi divergence. Neural Computation, 34(4):829–855, 2022a.
- Information is power: intrinsic control via information capture. Advances in Neural Information Processing Systems, 34:1074745–10758, 2021.
- Smirl: Surprise minimizing rl in dynamic environments. arXiv preprint arXiv:1912.05510, 2019.
- A factor graph description of deep temporal active inference. Frontiers in Computational Neuroscience, 11:95, 2017. ISSN 1662-5188. doi:10.3389/fncom.2017.00095. URL https://www.frontiersin.org/article/10.3389/fncom.2017.00095.
- The graphical brain: Belief propagation and active inference. Network Neuroscience, 1(4):381–414, 2017. doi:10.1162/NETN_a_00018. URL https://doi.org/10.1162/NETN_a_00018.
- Active inference, bayesian optimal design, and expected utility. The Drive for Knowledge: The Science of Human Information Seeking, page 124, 2022b.
- Computational mechanisms of curiosity and goal-directed exploration. Elife, 8:e41703, 2019.
- Generalised free energy and active inference. Biological Cybernetics, 113(5):495–513, 2019. ISSN 1432-0770. doi:10.1007/s00422-019-00805-w. URL https://doi.org/10.1007/s00422-019-00805-w.
- Exploration in neo-hebbian reinforcement learning: Computational approaches to the exploration–exploitation balance with bio-inspired neural networks. Neural Networks, 151:16–33, 2022. ISSN 0893-6080. doi:https://doi.org/10.1016/j.neunet.2022.03.021. URL https://www.sciencedirect.com/science/article/pii/S0893608022000995.
- Branching Time Active Inference: The theory and its generality. arXiv:2111.11107 [cs], November 2021a.
- Branching Time Active Inference: Empirical study and complexity class analysis. arXiv:2111.11276 [cs], November 2021b.
- Learning perception and planning with deep active inference. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 3952–3956, 2020. doi:10.1109/ICASSP40776.2020.9054364.
- Neural dynamics under active inference: Plausibility and efficiency of information processing. Entropy, 23(4), 2021. ISSN 1099-4300. doi:10.3390/e23040454. URL https://www.mdpi.com/1099-4300/23/4/454.
- Modelling non-reinforced preferences using selective attention. arXiv preprint arXiv:2207.13699, 2022c.
- Active inference for stochastic control. In Machine Learning and Principles and Practice of Knowledge Discovery in Databases, pages 669–680, Cham, 2021. Springer International Publishing. ISBN 978-3-030-93736-2. doi:https://doi.org/10.1007/978-3-030-93736-2_47.
- Collaborative data scheduling with joint forward and backward induction in small satellite networks. IEEE transactions on communications, 67(5):3443–3456, 2019.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.