Mechanism of in‑context learning in Transformers
Characterize the mechanism by which Transformer-based Large Language Models perform in-context learning without parameter updates.
Sponsor
References
Despite the good performance of the ICL capabilities, the mechanism of ICL still remains an open question.
— Beyond the Black Box: Theory and Mechanism of Large Language Models
(2601.02907 - Gan et al., 6 Jan 2026) in Subsubsection In-Context Learning, Section 6: Inference Stage (Core Theories and Methods)