Extent to which open-weight, small/medium LLMs benefit from self-evolving reasoning
Determine the extent to which open-weight large language models of small and medium scale can benefit from self-evolving reasoning paradigms to extend their reasoning limits on hard tasks, particularly in settings where verification and refinement capabilities are weak or unstable.
References
It is still unclear to what extent open-weight reasoning models, especially small and medium-sized ones with broader accessibility, can benefit from self-evolving paradigms and extend their reasoning limits.
                — Deep Self-Evolving Reasoning
                
                (2510.17498 - Liu et al., 20 Oct 2025) in Section 1 (Introduction)