Relationship Between Task Difficulty and Generalization Performance in LLMs
Determine the relationship between task difficulty and generalization performance in large language models, specifically assessing whether training on easier tasks leads to improved performance on harder tasks and whether training on harder tasks improves performance on easier tasks across evaluation benchmarks.
References
As shown in Table \ref{tab:difficulty_tension}, despite ongoing research in this area, the relationship between generalization performance and task difficulty remains an open question.
— Revisiting Generalization Across Difficulty Levels: It's Not So Easy
(2511.21692 - Kordi et al., 26 Nov 2025) in Section 1, Introduction