Mechanisms underlying performance degradation with increasing instruction count
Investigate and characterize the internal mechanisms in large language models that cause instruction-following performance to degrade as the number of simultaneous instructions increases, including analysis of attention patterns and internal model representations to identify failure modes responsible for this degradation.
References
Although our study has systematically analyzed multiple-instructions-following ability of LLMs, several important questions remain for future work. Second, further investigation is needed into the mechanisms behind the performance degradation observed with increasing instruction count.
— When Instructions Multiply: Measuring and Estimating LLM Capabilities of Multiple Instructions Following
(2509.21051 - Harada et al., 25 Sep 2025) in Subsection Discussion, Section 5 Performance Prediction