Weighted Ensemble Milestoning
- WEM is a computational strategy that combines weighted ensemble sampling with milestoning to accurately compute rare-event kinetics and thermodynamics in complex molecular systems.
- It partitions configurational space into milestones and employs resampling to reduce simulation time while estimating MFPTs, rate constants, and free energy profiles.
- Extensions like WEM-RR and analytical corrections for diffusion enhance sampling diversity and adjust rates, making the approach valuable for drug discovery and biomolecular studies.
Weighted Ensemble Milestoning (WEM) is a computational strategy that synthesizes the milestoning approach with the weighted ensemble (WE) path sampling method to compute both thermodynamic and kinetic observables for rare event processes in complex molecular systems. By integrating the master-equation rigor of milestoning with the statistical efficiency of the WE algorithm, WEM achieves significant reductions in computational cost for simulating transitions that typically occur on timescales inaccessible to standard molecular dynamics (MD). This approach provides estimates for quantities such as mean first-passage times (MFPTs), committor probabilities, free energy profiles, rate constants, and time-correlation functions, with demonstrated applicability to both model systems and realistic biomolecular complexes (Ray et al., 2020, Ray et al., 2019).
1. Theoretical Framework
WEM partitions configurational space along a user-chosen reaction coordinate (RC) into a set of non-overlapping hypersurfaces referred to as milestones . Trajectories are initiated at each milestone and propagated until they reach a neighboring milestone. Two core statistical entities are accumulated:
- Milestone-to-milestone transition kernel : Probability that a trajectory starting on milestone first exits on neighboring milestone (),
where are statistical weights from WE, and is the set of trajectories starting on and first hitting .
- Mean lifetime at milestone , :
with the dwell time before escape.
The stationary distribution of fluxes is determined via:
subject to appropriate boundary conditions. The equilibrium occupancy at milestone is:
and the free energy profile referenced to milestone $0$ is:
The MFPT from reactant ($0$) to product () milestones follows:
where is the steady-state flux through the product milestone (Ray et al., 2020, Ray et al., 2019).
2. Algorithmic Workflow
WEM proceeds in the following main steps:
- Milestone anchoring: Save a representative configuration from initialization MD at each milestone .
- WE binning: Subdivide the segment between adjacent milestones into WE bins.
- WE iteration: Launch trajectories from , each with weight $1/N$. Propagate for a WE cycle duration under unbiased dynamics.
- Resampling: After each , retain a fixed number of trajectories in each bin by splitting and merging, conserving total bin weight.
- Milestone crossing accounting: If a trajectory exits to a neighboring milestone, record its exit statistics () and remove it.
- Assembly: Upon convergence, aggregate and for all adjacent milestone pairs, solve the master equation, and extract kinetic/thermodynamic observables.
This workflow is highly parallelizable, as WE sampling within each milestone segment is independent (Ray et al., 2020, Ray et al., 2019).
3. WEM-RR: Restraint-Release Extension
The WEM-Restraint-Release (WEM-RR) protocol addresses insufficient initial condition sampling from a single anchor per milestone in standard WEM. It imposes a harmonic restraint on the RC at each milestone for WE iterations to generate a set of distinct configurations . After discarding the restraint, full WE runs are started from each anchor, dramatically increasing sampling diversity without additional atomic-scale cost. This reduces variance in both the transition kernel and mean lifetime estimates .
Pseudocode for WEM-RR anchoring:
1 2 3 4 5 6 |
for each milestone i: restrain ξ(x) ≈ ξ_i; run L WE iterations → anchors {x_i^ℓ} for each ℓ in 1…L: initialize N trajectories at x_i^ℓ with w=1/(N L) run WE until K_{i,i±1}, T_i converge assemble global K, T → solve for kinetics & ΔG |
4. Analytical Corrections for Diffusion-Limited Binding
For association events dominated by diffusive arrival at a boundary milestone with a nearly flat free energy profile, WEM supports an analytical correction to the computed . This accounts for the effective fraction of the outer milestone's surface that leads inward (computed via Monte-Carlo sampling), the ligand diffusion coefficient , and the transition kernel element :
where is Avogadro's number. This correction enables accurate estimation of association rates in the diffusion-limited regime (Ray et al., 2020).
5. Benchmark Applications and Validation
WEM and WEM-RR have been validated on both model and biomolecular systems:
| System | Observables | Result accuracy (vs reference) |
|---|---|---|
| Na/Cl ion | Free energy barrier, | MFPT ≈ 1 ns (1.25 ns brute-force MD); |
| pair in water | MFPT, , | s, Ms, (within <1 of reference) |
| FKBP–BUT ligand | , MFPT, | Standard WEM: large variance; WEM-RR (): ns (cf. 7.23.0 ns MD), , Ms, mM, (closely matching long Anton MD and metadynamics) |
In all cases, WEM achieves order-of-magnitude improvements in computational effort compared to conventional or WE-only simulations, with final results in agreement with reference MD and, where available, experiment (Ray et al., 2020).
6. Implementation, Efficiency, and Applicability
WEM's computational steps can be summarized as follows:
- Milestone selection: Milestones should be placed orthogonally to the slow reaction coordinate; finer spacing in high free energy barrier regions is beneficial.
- WE bins: Within each milestone cell, 5–20 bins are typical; binning enables granular resampling needed for efficient WE.
- Convergence: is routinely used; stability in and needed before assembly.
Computational efficiency is high due to the embarrassingly parallel nature of individual WE runs between milestones. For example, FKBP–BUT dissociation calculations (both kinetics and thermodynamics) required less than 100 ns of aggregate simulation time across all milestones, in contrast to tens of μs for brute-force MD (Ray et al., 2020). No biasing forces are introduced, so physical mechanisms are preserved. Quantities accessible include committor probabilities, MFPTs, all rate constants (on/off), , free energy landscapes, and full time-correlation functions.
WEM and WEM-RR are especially suited to high-throughput sampling in drug discovery, screening, and any application requiring rare-event kinetics well beyond the microsecond scale, as long as an appropriate low-dimensional RC can be identified (Ray et al., 2020, Ray et al., 2019).
7. Relation to Other Enhanced Sampling Strategies
WEM is distinguished from classical milestoning by its use of WE to efficiently estimate transition statistics within the milestone formalism. The synergy enables accurate thermodynamics and kinetics in fewer total MD steps versus either method alone. The master-equation-based assembly of observables ensures quantitative rigor, while WE acceleration mitigates the sampling bottlenecks endemic to direct trajectory-based methods. In contrast to single-trajectory brute-force MD and equilibrium WE methods, WEM leverages both statistical resampling and partitioned configuration space to realize faster convergence and accessibility to both equilibrium and dynamical quantities (Ray et al., 2019).
References:
- [Kinetics and Free Energy of Ligand Dissociation Using Weighted Ensemble Milestoning, (Ray et al., 2020)]
- [Weighted Ensemble Milestoning (WEM): A Combined Approach for Rare Event Simulations, (Ray et al., 2019)]