Causes of Increased Target-Language Response Variance
Investigate whether the increased response variance observed when large language models answer knowledge-intensive questions in target languages arises as a coping mechanism to prevent perplexity loss escalation due to cross-language factual inconsistencies in pretraining data, and rigorously identify the underlying causes of this variance.
References
We demonstrated that variance of responses increases in target but it is unclear what led to it (Appendix). Is increased variance in target a coping mechanism of LLMs to keep perplexity loss from exploding due to cross-language factual inconsistencies in pretraining data? We leave such analysis also for future work.
— Rethinking Cross-lingual Gaps from a Statistical Viewpoint
(2510.15551 - Piratla et al., 17 Oct 2025) in Discussion, Future Work and limitations (Section: sec:conclusion)