NETI: Non-Equilibrium Thermodynamic Integration
- NETI is a computational technique that estimates Bayes factors by variationally annealing between Bayesian posteriors to reduce estimator variance.
- It applies non-equilibrium statistical mechanics principles to bypass the high-variance prior regimes inherent in standard thermodynamic integration.
- NETI achieves significant variance reduction in nested model comparisons through fine-grained discretization and minimized relaxation errors.
Non-Equilibrium Thermodynamic Integration (NETI) is a computational methodology for estimating Bayes factors via marginal likelihood ratios between Bayesian models, with a focus on minimizing estimator variance and discretization error. NETI variationally anneals between posterior distributions by leveraging non-equilibrium statistical mechanics principles, systematically circumventing the high-variance regimes associated with conventional prior-to-posterior thermodynamic integration (TI). This approach yields significant variance reduction when models share parameters, particularly in nested model comparison scenarios (Grzegorczyk et al., 2017).
1. Background: Thermodynamic Integration for Marginal Likelihoods
Thermodynamic integration (TI) is a standard approach to estimate the marginal likelihood for a model given data , where
with as the parameter vector. TI constructs a family of tempered "power posteriors":
with normalization such that and . The log marginal likelihood decomposes as
numerically integrated by discretizing and estimating expectations with MCMC.
A major limitation of TI is the "prior regime" for near zero, where is dominated by the prior. The Monte Carlo approximation of in this regime is highly variable, especially when the likelihood is diffuse under the prior or has high-dimensional parameter space. This can dominate total estimator variance, requiring impractically fine temperature grids or large MCMC samples for reliable estimation (Grzegorczyk et al., 2017).
2. Direct-Path TI for Model Comparison
For hypothesis testing or model comparison, the target is the Bayes factor . The direct-path TI method instead defines an annealing path between the posterior of and :
with a joint prior marginalizing to the individual model priors. The path’s partition function is
One obtains
Integrating yields
where . This path systematically avoids the problematic prior regime inherent in standard TI (Grzegorczyk et al., 2017).
3. Non-Equilibrium Thermodynamic Integration Framework
The non-equilibrium TI (NETI) framework adapts statistical mechanical concepts, in particular Jarzynski's equality, to estimate normalizing constant ratios:
where is the accumulated "work" along a non-equilibrium protocol as evolves from $0$ to $1$. In practice, a single long MCMC trajectory is performed, adiabatically updating in fine-grained steps , at each recording . The continuous path integral
is discretized as
with large. Discretization error scales as , but becomes negligible as increases (typically as many temperature steps as total MCMC iterations). The dominant remaining error component, the "relaxation error," diminishes as , versus the scaling of standard TI (Grzegorczyk et al., 2017).
4. NETI-DIFF Algorithmic Implementation
The NETI-DIFF algorithm proceeds as follows. For a schedule (power-law or sigmoid, as appropriate), the procedure:
- Initialize by sampling from .
- For to :
- Set .
- Perform MCMC update(s) targeting .
- Record .
- Advance .
- Compute the estimator:
This approach leverages non-equilibrium integration, drastically increasing temperature resolution without significant computational overhead since each is updated only briefly at each (Grzegorczyk et al., 2017).
5. Variance Reduction Theoretical Results
Let and respectively denote variance for standard TI and NETI-DIFF estimators. Under mild regularity conditions, when models share parameters,
with a prefactor approximately reduced by , while
with no reduction. This indicates orders-of-magnitude variance reduction for NETI-DIFF in high-overlap (e.g., nested) model scenarios (Grzegorczyk et al., 2017). A plausible implication is that NETI-DIFF particularly excels in Bayesian model selection tasks featuring nested or similar model parametrizations.
6. Empirical Evaluations and Benchmarks
Empirical assessment compared standard TI (trapezoidal), TI with Friel & Pettitt corrections, and NETI-DIFF on the following benchmarks:
- Radiata pine: Linear, , non-nested regressions, closed-form .
- Pima Indians: Logistic, , nested regressions, gold standard.
- Radiocarbon: Polynomial Bayesian linear regression (orders up to 10), closed-form BF.
- Biopathway: Nonlinear hierarchical ODE model, network inference, surrogate gold standard.
Performance metrics include average absolute error and variance , with ranging to and between 10 and 200. Principal findings:
- Radiata pine (non-nested): No significant difference, NETI TI.
- Pima Indians (nested): NETI reduced and by factors of 5–50.
- Radiocarbon: NETI reduced up to for large model differences.
- Biopathway: NETI reduced by one to two orders of magnitude; improved network-selection accuracy (Grzegorczyk et al., 2017).
7. Practical Guidelines for NETI-DIFF
- Path design: Use a power-law schedule () for nested models, denser near ; apply symmetric sigmoid for non-nested comparisons to mitigate end-bias.
- Number of steps: Set number of temperature steps equal to MCMC iterations; NETI-DIFF obviates need for full equilibrium at each .
- Computation: Similar per-iteration cost to TI; in some cases slightly less for NETI-DIFF due to holding shared parameters constant at .
- Variance reduction strategies: Combine with control variate techniques (CTI) by applying variance-reduction corrections to after path construction.
- Overall effect: NETI-DIFF replaces the dual prior-to-posterior integration with a single posterior-to-posterior integration, completely bypassing high-variance prior regimes and exploiting fine-grained non-equilibrium annealing schedules for substantial variance reductions in appropriate model-comparison settings (Grzegorczyk et al., 2017).