Intersection Supermartingale Products
- Intersection supermartingale products are rigorous probabilistic constructions that combine single-stratum nonnegative supermartingales to test composite hypotheses under controlled Type I error.
- They leverage adaptive sampling and predictable stratum selection to yield anytime-valid P-values, substantially increasing efficiency in risk-limiting audits.
- Empirical results show that these methods can significantly reduce sample size and risk, outperforming traditional approaches like SUITE and SHANGRLA.
Intersection supermartingale products are a rigorous probabilistic apparatus combining single-stratum nonnegative supermartingale (NNSM) processes to enable composite union-intersection hypothesis tests, particularly relevant in stratified risk-limiting audits (RLAs). This construction controls Type I error guarantees nonparametrically, applies under any predictable sampling strategy, and supports adaptive allocation of sampling resources across strata. Intersection supermartingale products yield anytime-valid -values and can substantially increase efficiency in practical RLA settings, offering improvements over earlier approaches such as SUITE and SHANGRLA by exploiting the generality and flexibility of NNSMs (Spertus et al., 2022).
1. Single-Stratum Nonnegative Supermartingales
The fundamental component is the single-stratum NNSM, defined for a stratum of size with bounded nonnegative assorter values . To test the one-sided null hypothesis for some , samples are drawn sequentially. At each sample, a predictable, typically upward-biased estimator of the mean is used to define a nonnegative factor satisfying under 0.
The single-stratum NNSM process is then
1
Ville’s inequality ensures 2 under 3 for any 4, granting finite-sample error control and anytime validity.
2. Intersection Supermartingale Construction
For testing the global null hypothesis 5 for all strata 6, intersection supermartingale products combine the single-stratum NNSMs into a global process. At each sampling epoch 7, a (predictable) stratum selector 8 identifies the stratum to be sampled, measurable with respect to the filtration 9 generated by the previous samples and stratum selections. Let 0 be the total draws from stratum 1 up to time 2.
The update at time 3 is:
4
where 5 is the new sample from stratum 6. The global intersection supermartingale is
7
By construction, 8 under 9, and Ville's inequality immediately applies:
0
Therefore, the first crossing time 1 such that 2 achieves strong control of the familywise Type I error for the intersection hypothesis.
3. Adaptive Stratum Selection and Efficiency
The validity of the global process for any predictable sampling rule enables adaptive allocation. Two canonical strategies are:
- Proportional Allocation: Sampling strata in proportion to their sizes (3), a nonadaptive but valid rule.
- Lower-Sided Screening (Hard Stop): In parallel, a one-sided test is run for each stratum for the alternative 4, using reversed bets. When this lower-sided supermartingale exceeds 5, sampling for that stratum ceases; subsequent 6 selections are restricted to remaining strata.
The lower-sided screening approach can reduce expected sample size by up to approximately 7 in illustrative two-stratum cases. Once a stratum ceases to provide evidence for 8, it is efficiently excluded from further sampling (Spertus et al., 2022).
4. Empirical Results and Case Studies
Application to both real-world and simulated audits demonstrates marked efficiency gains. In the 2018 Whitmer–Schuette race pilot audit in Kalamazoo, MI (two strata: CVR 9, no-CVR 0), the SUITE approach yielded a 1-value of 2. Re-analysis with intersection supermartingales reduced this to approximately 3, representing a more than order-of-magnitude reduction in measured risk, even after conditioning on observed samples.
In a large-scale simulation across 58 California counties (total 4), employing countywise Empirical-Bernstein supermartingales combined by the intersection product, sampling proportional to county turnout and stopping at 5 with 6, 7 of 8 replicates completed after 9 draws. The global 0-value computation, requiring a small LP (since the EB log-1 is linear in county means), took approximately 2 seconds per instance on commodity hardware (Spertus et al., 2022).
5. Practical Algorithm and Implementation
The intersection supermartingale audit can be expressed concisely as:
1
The function compute_factor is required to ensure 3 under 4, guaranteeing the supermartingale property. For numerical safety, truncation of 5-factors is allowed provided it only decreases the value, preserving the error control.
A valid omnibus 6-value at time 7 is 8, forming an anytime-valid 9-process.
6. Theoretical Guarantees and Connections
Intersection supermartingale products deliver Type I error control for global intersection nulls, generalizing classical sequential tests. The supermartingale property is robust to arbitrary predictable sampling rules, encompassing both non-adaptive and adaptive stratum selection. This universal validity supports optimal resource allocation in practical RLAs and ensures computational tractability even in high-dimensional stratification (e.g., many counties).
The intersection product structure provides a rigorous foundation for union–intersection testing in stratified designs and generalizes to any family of one-sided hypotheses with stratumwise NNSMs. The compatibility with Ville’s inequality and anytime 0-value construction supports efficient, sequential evidence accumulation. A plausible implication is that this approach could transfer to other multi-group sequential analysis tasks in statistics or machine learning involving stratified or dependent data (Spertus et al., 2022).
7. Related Methodologies and Generalizations
Intersection supermartingale products substantially synergize techniques from SHANGRLA, SUITE, and ALPHA. They inherit the computational tractability and modularity of SHANGRLA’s reduction to mean testing, the union-intersection logic of SUITE, and the nonparametric flexibility of ALPHA’s NNSM-based construction. Key references include Stark (2022) on intersection supermartingales for union–intersection tests, Waudby‐Smith et al. (2021) and Stark (2020) on betting martingales and the SHANGRLA framework.
The underlying principles are not reliant on specifics of ballot sampling and are extensible to other contexts—anywhere nonnegative supermartingale tests for intersection nulls, stratified adaptivity, or anytime error control are vital. This suggests broad applicability wherever composite hypotheses are tested under sequential, adaptive, or stratified sample collection regimes.
References:
- "Sweeter than SUITE: Supermartingale Stratified Union-Intersection Tests of Elections" (Spertus et al., 2022)
- Stark (2022), “Intersection supermartingales for union–intersection tests,” Stat Sci.
- Waudby-Smith et al. (2021), “Betting martingales for RLAs.”
- Stark (2020), “SHANGRLA: A framework for RLAs.”