Extent of human-like abstract reasoning achieved by AI on ARC tasks
Determine the extent to which state-of-the-art AI models that achieve high accuracy on ARC tasks (e.g., OpenAI’s o3 reasoning model) have achieved human-like abstract reasoning abilities, as opposed to relying on surface-level patterns or shortcuts.
Sponsor
References
Despite the high accuracy of o3 on ARC tasks, it is not clear to what extent AI systems have achieved human-like abstract reasoning abilities.
— Do AI Models Perform Human-like Abstract Reasoning Across Modalities?
(2510.02125 - Beger et al., 2 Oct 2025) in Section 1, Introduction