Cause of performance decline of o3-mini across later physics topics
Ascertain the causes of the observed decline in accuracy of OpenAI’s o3-mini on story problems as topics progress across Halliday and Resnick’s Fundamentals of Physics Vol. 1, particularly in waves and thermodynamics, and determine the factors driving this drop in performance.
References
The question of why the model performance of o3-mini was dropping as the topics progressed remains open.
                — AI Reasoning Models for Problem Solving in Physics
                
                (2508.20941 - Bralin et al., 28 Aug 2025) in Section 6: Limitations and Future Work