Dice Question Streamline Icon: https://streamlinehq.com

Quantify the scaling threshold needed to recover asymptotic properties with finite MC samples

Determine the minimal scaling of data and/or Monte Carlo sample sizes required to restore the asymptotic validity of maximum-likelihood estimators and the profile-likelihood ratio—yielding correct coverage—when using the full Barlow–Beeston likelihood for high-statistics binned analyses with nuisance parameters and finite-size Monte Carlo templates. The goal is to ascertain quantitatively how large the scaling of data and/or simulation must be so that Wilks’ theorem applies and Hessian- or PLR-based confidence intervals achieve the nominal coverage.

Information Square Streamline Icon: https://streamlinehq.com

Background

The paper demonstrates, via a paradigmatic toy model, that in high-statistics binned analyses where the model is built from finite Monte Carlo samples and includes nuisance parameters, confidence intervals based on asymptotic properties (Hessian or profile-likelihood ratio) can systematically under-cover. The authors show numerically that increasing the size of data and/or simulation eventually restores the expected asymptotic behavior, but the required scale is not straightforward to determine.

This uncertainty arises despite large event counts per bin and many bins, indicating that the conventional indicators of asymptotics (e.g., large N) are insufficient when MC template fluctuations and nuisance parameter profiling are present. A clear quantitative criterion for the necessary scaling would guide practitioners in ensuring correct coverage without resorting to impractically large simulations.

References

We found that the asymptotic properties can eventually be recovered by applying a sufficiently large scaling of the data and/or simulation sizes, but we could not find a clear indication of which scale should be considered sufficiently "large".

Under-coverage in high-statistics counting experiments with finite MC samples (2401.10542 - Alexe et al., 19 Jan 2024) in Discussion, Section 2