Dice Question Streamline Icon: https://streamlinehq.com

Analysis of Mallows model averaging with non-unique optimal mixtures

Analyze coverage guarantees and weight convergence for Algorithm 1 when Mallows model averaging is applied and the population prediction loss \(\mathcal{C}_\infty(\mathbf w)\) has multiple minimizers on the simplex, leading to non-unique optimal mixes and potentially random limiting weights.

Information Square Streamline Icon: https://streamlinehq.com

Background

The paper proves n\sqrt{n}-rate convergence of Mallows weights under standard time-series conditions when either a single model uniquely minimizes the population KLIC or a unique mixture minimizes the population prediction loss.

However, when several mixtures yield the same population risk, the limiting weights need not be unique and may remain random. The authors explicitly leave the theoretical analysis of the algorithm in this non-unique minimizer case for future work.

References

These two lemmas do not cover the case where the population loss has multiple minimizers, i.e., when different model combinations yield the same population risk. In that case, the estimated weights may remain random asymptotically. Analysis of the algorithm in this setting is left for future work.

Prediction Intervals for Model Averaging (2510.16224 - Qu et al., 17 Oct 2025) in Section 4.3 (Mallows criterion)