Reasons for the o3 preview–release accuracy discrepancy on ARC-AGI-1
Identify the causes of the large discrepancy in ARC-AGI-1 accuracy between the o3-preview (pre-release) model and the subsequently released o3 model.
References
The reasons for this discrepancy are not known.
— Do AI Models Perform Human-like Abstract Reasoning Across Modalities?
(2510.02125 - Beger et al., 2 Oct 2025) in Section: Discussion (footnote)