From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought (2306.12672v2)

Published 22 Jun 2023 in cs.CL, cs.AI, and cs.SC

Abstract: How does language inform our downstream thinking? In particular, how do humans make meaning from language--and how can we leverage a theory of linguistic meaning to build machines that think in more human-like ways? In this paper, we propose rational meaning construction, a computational framework for language-informed thinking that combines neural LLMs with probabilistic models for rational inference. We frame linguistic meaning as a context-sensitive mapping from natural language into a probabilistic language of thought (PLoT)--a general-purpose symbolic substrate for generative world modeling. Our architecture integrates two computational tools that have not previously come together: we model thinking with probabilistic programs, an expressive representation for commonsense reasoning; and we model meaning construction with LLMs, which support broad-coverage translation from natural language utterances to code expressions in a probabilistic programming language. We illustrate our framework through examples covering four core domains from cognitive science: probabilistic reasoning, logical and relational reasoning, visual and physical reasoning, and social reasoning. In each, we show that LLMs can generate context-sensitive translations that capture pragmatically-appropriate linguistic meanings, while Bayesian inference with the generated programs supports coherent and robust commonsense reasoning. We extend our framework to integrate cognitively-motivated symbolic modules (physics simulators, graphics engines, and planning algorithms) to provide a unified commonsense thinking interface from language. Finally, we explore how language can drive the construction of world models themselves. We hope this work will provide a roadmap towards cognitive models and AI systems that synthesize the insights of both modern and classical computational perspectives.

Citations (86)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/eating_entropy/status/1755012318331863430

https://twitter.com/bryan_glazer/status/1842279500265984188

https://twitter.com/JagersbergKnut/status/1864595422347440552

https://twitter.com/xuanalogue/status/1750909723795239409

https://twitter.com/davidbkinney/status/1819050634060193998

https://twitter.com/secemp9/status/1803747151308063155

From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought (2306.12672v2)

Summary

Related Papers

Tweets