Scaling behavior and practical relevance of inductive out-of-context reasoning (OOCR)
Establish the extent to which inductive out-of-context reasoning (OOCR) in large language models scales to learning more complex latent variables and determine its practical relevance for current large language models.
References
It is an open question how much inductive OOCR scales to learning more complex latents and how much it has practical relevance for current LLMs.
— Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data
(2406.14546 - Treutlein et al., 20 Jun 2024) in Introduction (Section 1)