Do multilingual reasoning patterns extend beyond math and science?
Ascertain whether the multilingual reasoning patterns observed when Large Reasoning Models reason in English versus the question’s language on the MGSM and GPQA Diamond benchmarks (mathematical and scientific tasks) extend to open-ended questions and other task types.
References
Our evaluation of multilingual reasoning is confined to mathematical and scientific tasks; whether the patterns we observe extend to open-ended questions or other task types remains an open question.
— The Reasoning Lingua Franca: A Double-Edged Sword for Multilingual AI
(2510.20647 - Saji et al., 23 Oct 2025) in Section: Limitations