Bayes Factor Stopping Regions
- Bayes factor stopping regions are defined by thresholds at which a sequential Bayesian test halts, providing an operational criterion for early stopping.
- They integrate decision-theoretic risk minimization and controlled Type I error rates by interpreting evidence in posterior odds.
- Their applications span clinical trials and experimental designs, with extensions to soft classification and continuous-time drift testing.
A Bayes factor stopping region is the set of all sample paths or statistic values for which the Bayes factor crosses pre-specified thresholds, at which point a sequential Bayesian test halts sampling and renders a terminal inference. This construction lays the mathematical and operational basis for Bayesian sequential analysis, providing an evidence-based criterion for early stopping that is interpretable in terms of posterior odds, decision-theoretic risk minimization, and (in special cases) frequentist error control. The theoretical and practical ramifications of these stopping regions have been the focus of significant analysis in sequential testing theory, optimal stopping, group-invariant models, clinical trial design, and the study of optional stopping in Bayesian inference.
1. Mathematical Definition and Construction
A Bayes factor stopping region is determined by the evolution of the (possibly sequentially updated) Bayes factor , which, at each sample size , compares the marginal likelihoods of all data observed so far under two competing hypotheses: where %%%%2%%%% is the model under , and is its prior (Heide et al., 2017, Hendriksen et al., 2018).
A Bayes factor stopping rule is any rule that stops sampling the first time such that
for constants (or, by symmetry, ). The corresponding Bayes factor stopping region is the set: with the continuation region being the complement (Heide et al., 2017, Hendriksen et al., 2018). At stopping, the decision is made in favor of if or if .
This structure generalizes to composite hypotheses with nuisance parameters and to the continuous-time setting, where (e.g. in the drift testing of Brownian motion) the Bayes factor is constructed from filtered posteriors over the unknown parameter, defining boundaries in the space of posterior probabilities or Bayes factors (Ekström et al., 2015, Campbell et al., 20 Jan 2025).
2. Bayes Factor Stopping Regions in Sequential Drift Testing
In the canonical Bayesian sequential test of a Brownian motion's drift,
the stopping region can be recast as an optimal stopping problem for the posterior probability process : with . In Bayes factor terms, this is equivalent to monitoring the process , where are the prior odds. The stopping boundaries (or equivalently, in the Bayes factor scale) are characterized by systems of integral equations in both finite- and infinite-horizon settings (Ekström et al., 2015).
The table below summarizes the principal components of Bayes factor stopping regions in Brownian motion drift testing:
| Element | Description | Reference |
|---|---|---|
| Signal process | (Ekström et al., 2015) | |
| Statistic | Posterior ; Bayes factor | (Ekström et al., 2015) |
| Stopping rule | Exit from interval | (Ekström et al., 2015) |
| Boundary equations | System of integral equations (finite/infinite) | (Ekström et al., 2015) |
These boundaries are monotonic in time (non-decreasing for , non-increasing for ) and are continuous functions of , with long-term asymptotics determined by the prior mass near the decision boundary (Ekström et al., 2015).
3. Theoretical Guarantees Under Optional Stopping
Bayes factor stopping regions possess several distinct mathematical guarantees in the context of optional stopping:
- Stopping-rule independence (-independence): For any stopping time , the posterior odds at stopping depend only on the observed data, not on the manner in which the stopping time is chosen, provided the prior is fixed before data collection (Heide et al., 2017, Hendriksen et al., 2018).
- Posterior calibration: The empirical frequency with which certain Bayes factor values arise under versus matches the nominal Bayes factor; for unit prior odds, (Heide et al., 2017, Hendriksen et al., 2018).
- Semi-frequentist Type I error control: Markov's inequality yields that, for any ,
so that exceeding the threshold at any time yields a maximal Type I error of (Hendriksen et al., 2018).
However, these guarantees can weaken if "default" or design-dependent priors are used, or when Bayesian updates are not those of a subjective Bayesian (see Section 4).
4. Impact of Prior Choice and Calibration Results
The efficacy and interpretation of Bayes factor stopping regions under optional stopping are sensitive to the nature of the employed priors (Heide et al., 2017). Three principal types are identified:
- Type 0 (Group-invariant/Haar) priors: For nuisance parameters with group structure (e.g., scale in normal families), right-Haar priors guarantee strong calibration under all stopping rules.
- Type I (Proper, non-invariant) priors: Default priors not tied to a group invariance (e.g., Cauchy on effect size) only ensure prior-based calibration; strong calibration may fail, and under optional stopping, significant miscalibration can arise at fixed parameter values.
- Type II (Design-dependent) priors: Priors depending on sample size, design, or sampling scheme (e.g., g-prior in regression) render calibration ill-defined under optional stopping.
This classification guides recommendations: optional stopping is valid for fully subjective Bayesians or for group-invariant nuisance calibration. Otherwise, the Bayes factor can severely misrepresent evidence when stopping is dictated by accumulating data, especially for parameters of scientific interest (Heide et al., 2017).
5. Explicit Boundary Characterizations and Algorithmic Implementation
In practical Bayesian sequential design—especially in clinical or experimental trials—the Bayes factor stopping region is often implemented as z-statistic boundaries across pre-scheduled (group-sequential) interim analyses. For common models, the log Bayes factor is a monotone function of the cumulative z-statistic at each look, allowing direct computation of critical values via root-finding:
For a fixed schedule of interim looks, one computes stopping ( or ) and continuation regions at each look , translating Bayes factor boundaries into operational criteria for practice (Pawel et al., 6 Jan 2026). Key operating characteristics—power, expected sample size, type I/II error rates—are computed via multivariate normal integrals over these stopping regions, avoiding simulation (Pawel et al., 6 Jan 2026).
6. Extensions: Soft Classification and Free-Boundary Problems
A recent generalization replaces the classical hard 0-1 loss in sequential drift testing with a soft classification penalty , inducing expanded continuation regions whose boundaries are characterized by free-boundary PDEs and analytic two-tangent conditions on convex envelopes (Campbell et al., 20 Jan 2025). The optimality region in posterior or Bayes factor space is determined by the shape of , the signal-to-noise ratio, and the observation cost, seamlessly extending optimal stopping theory to soft-decision and continuous-risk settings.
For instance, with (symmetric loss), the boundaries satisfy implicit equations tied to the joint effect of loss and information, demonstrating that as the signal strength increases, the region where continued sampling is optimal covers almost all posterior mass—matching the classical sequential probability ratio test in the limit (Campbell et al., 20 Jan 2025).
7. Practical and Conceptual Implications
Bayes factor stopping regions provide a principled, interpretable, and implementable criterion for sequential inference in both exploratory and confirmatory research. They connect Bayesian evidence monitoring with decision-theoretic risk minimization, permit frequentist-style error rate control in well-specified settings, and allow rapid design optimization in sequential and group-sequential trials. Nonetheless, the properties of Bayes factor stopping regions rest crucially on the choice and interpretation of the prior, the invariance structure of the model, and the measurability of stopping rules. Full calibration and error control require strict adherence to group-invariant prior constructions and stopping rules dependent only on maximal invariants (or the Bayes factor itself) (Heide et al., 2017, Hendriksen et al., 2018).
A plausible implication is that, in routine scientific use with default or pragmatic priors, Bayesian sequential designs using Bayes factor stopping rules may only provide reliable evidence quantification and error control in nuisance or exploratory settings, and care must be taken to avoid evidence overstatement in confirmatory inference with parameters of interest.
References:
(Ekström et al., 2015): Bayesian sequential testing of the drift of a Brownian motion (Heide et al., 2017): Why optional stopping can be a problem for Bayesians (Hendriksen et al., 2018): Optional Stopping with Bayes Factors: a categorization and extension of folklore results, with an application to invariant situations (Campbell et al., 20 Jan 2025): A Bayesian sequential soft classification problem for a Brownian motion's drift (Pawel et al., 6 Jan 2026): Bayes Factor Group Sequential Designs