Necessity of explicit compositional incentives in RL
Determine whether explicit compositional incentives in reinforcement learning training objectives are necessary for large language models to acquire compositional skills, in contrast to RL conducted solely on atomic tasks without such incentives.
References
Comparing the two works, we conjecture that an explicit incentive to composition is necessary.
— From $f(x)$ and $g(x)$ to $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
(2509.25123 - Yuan et al., 29 Sep 2025) in Section 2, Background