Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs (2405.11880v1)

Published 20 May 2024 in cs.LG, cs.AI, cs.CL, and cs.CV

Abstract: In this study, we propose an axiomatic system to define and quantify the precise memorization and in-context reasoning effects used by the LLM for language generation. These effects are formulated as non-linear interactions between tokens/words encoded by the LLM. Specifically, the axiomatic system enables us to categorize the memorization effects into foundational memorization effects and chaotic memorization effects, and further classify in-context reasoning effects into enhanced inference patterns, eliminated inference patterns, and reversed inference patterns. Besides, the decomposed effects satisfy the sparsity property and the universal matching property, which mathematically guarantee that the LLM's confidence score can be faithfully decomposed into the memorization effects and in-context reasoning effects. Experiments show that the clear disentanglement of memorization effects and in-context reasoning effects enables a straightforward examination of detailed inference patterns encoded by LLMs.

References (30)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/llmsresearch/status/1795144577705238972

Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs (2405.11880v1)

Summary

Related Papers

Tweets