Dice Question Streamline Icon: https://streamlinehq.com

Stable Performance Under High-Severity Disfluency

Develop methods that enable Speech Large Language Models (Speech-LLMs) to maintain stable performance under high-severity disfluency conditions.

Information Square Streamline Icon: https://streamlinehq.com

Background

Through fine-grained severity analyses, the paper shows that many models handle mild disfluency but degrade sharply at higher severity levels, particularly for phoneme-level repetition and long-context disruptions. This reveals robustness limitations that persist despite improvements in components such as ASR and LLM reasoning.

The authors explicitly identify achieving stability at high severity as an unresolved challenge, underscoring the need for targeted modeling, training objectives, and system design to ensure reliable performance under severe disfluency.

References

Overall, achieving stable performance under high-severity disfluency remains an open research problem.

VocalBench-DF: A Benchmark for Evaluating Speech LLM Robustness to Disfluency (2510.15406 - Liu et al., 17 Oct 2025) in Appendix, Further Experimental Results (following Tables on severity-level analyses)