Unknown architecture and mechanisms of OpenAI’s o1 Large Reasoning Model
Characterize the architecture, operational mechanisms, and internal capabilities of OpenAI’s o1 (Strawberry) Large Reasoning Model that differentiate it from standard autoregressive Large Language Models, including the structures employed during pretraining and inference.
References
we draw a distinction between previous LLMs and o1, a Large Reasoning Model (or LRM), as its new (unknown) architecture, operation, and capabilities all seem to be fundamentally different from those of vanilla LLMs, both at pretraining phase and at inference time.
— LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench
(2409.13373 - Valmeekam et al., 20 Sep 2024) in Introduction, footnote discussing Large Reasoning Models (following OpenAI), Section 1