Transformative schema creation for RL-trained LLMs
Establish whether reinforcement learning can enable large language models to achieve transformative generalization by creating new solution schemas—"schema creation"—for qualitatively novel cases, such as discovering invariants needed to solve perfectly periodic or degenerate dynamics in the BouncingSim benchmark.
References
Coding tasks appear more amenable to structural composition than symbolic math, yet transformative 'schema creation' remains an open challenge.
— DELTA-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs?
(2509.21016 - Sun et al., 25 Sep 2025) in Section 5 (Generalization Study), Takeaways