Dice Question Streamline Icon: https://streamlinehq.com

Determinants of Source-Language Response Variance

Identify and quantify the factors that contribute to response variance in source-language prompts for large language models on knowledge-intensive tasks, and characterize how these factors propagate to determine target-language variance and cross-lingual gaps.

Information Square Streamline Icon: https://streamlinehq.com

Background

The authors show that cross-lingual gaps diminish when source confidence (the probability of the modal response) is high and present theoretical and empirical links between source and target variance, implying that source-side factors are pivotal.

However, they explicitly state that the drivers of source-language variance are unknown and provide related observations (e.g., entity-centric gaps) without establishing causal factors.

References

The main paper argued that cross-lingual gaps are due to high variance in source, which also determines the variance in target. The factors contributing to variance in source are unclear.

Rethinking Cross-lingual Gaps from a Statistical Viewpoint (2510.15551 - Piratla et al., 17 Oct 2025) in Appendix: What determines the variance of responses? (Section: appendix:sbet)