How do Language Models Bind Entities in Context? (2310.17191v2)

Published 26 Oct 2023 in cs.LG, cs.AI, and cs.CL

Abstract: To correctly use in-context information, LLMs (LMs) must bind entities to their attributes. For example, given a context describing a "green square" and a "blue circle", LMs must bind the shapes to their respective colors. We analyze LM representations and identify the binding ID mechanism: a general mechanism for solving the binding problem, which we observe in every sufficiently large model from the Pythia and LLaMA families. Using causal interventions, we show that LMs' internal activations represent binding information by attaching binding ID vectors to corresponding entities and attributes. We further show that binding ID vectors form a continuous subspace, in which distances between binding ID vectors reflect their discernability. Overall, our results uncover interpretable strategies in LMs for representing symbolic knowledge in-context, providing a step towards understanding general in-context reasoning in large-scale LMs.

Citations (26)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/jackm2003/status/1822621432020414585

https://twitter.com/jackm2003/status/1822677825125835169

https://twitter.com/PresItamar/status/1822675839357522428

https://twitter.com/1356751736350724097/status/1737182459152994337

https://twitter.com/MuzafferKal_/status/1761500511063400495

How do Language Models Bind Entities in Context? (2310.17191v2)

Summary

Related Papers

Tweets