FALCON: Few-step Likelihoods in Continuous Flows

Updated 12 December 2025

The paper introduces a hybrid few-step flow map and training objective that accurately computes likelihoods with minimal ODE steps.
It achieves up to two orders of magnitude speedup over traditional CNFs while preserving the exact change-of-variables formulation.
Empirical evaluations on molecular systems confirm that FALCON maintains high sample quality and efficiency compared to existing methods.

Few-step Accurate Likelihoods for Continuous Flows (FALCON) is a methodology designed to address the computational inefficiencies inherent to likelihood evaluation and sampling in continuous normalizing flows (CNFs), particularly in the context of molecular Boltzmann sampling. FALCON introduces a hybrid training objective and a few-step flow map, enabling accurate likelihood computation with a minimal number of network evaluations, while maintaining the invertibility required for exact change-of-variables formulations. This approach yields substantial acceleration—up to two orders of magnitude in inference speed—over prior CNF architectures without sacrificing empirical sample quality or likelihood accuracy (Rehman et al., 10 Dec 2025).

1. Continuous Normalizing Flows and Exact Likelihoods

Continuous normalizing flows (CNFs) model transformations between probability distributions via the solution of an ordinary differential equation (ODE) parameterized by a neural vector field $f:\mathbb{R}^d \times [0,T] \rightarrow \mathbb{R}^d$ . Given an initial condition $x(0)\sim p_0$ , the ODE

$\frac{dx}{dt} = f(x(t), t)$

drives $x(t)$ from the base distribution $p_0$ to a target distribution $p_1$ . The evolution of the log-density follows

$\frac{d}{dt}\log p(x(t)) = -\operatorname{Tr}\left( \partial_x f(x(t), t) \right),$

integrating to provide the exact likelihood: $\log p(x(T)) = \log p(x(0)) - \int_0^T \operatorname{Tr}(\partial_x f(x(t), t)) dt.$ In practice, evaluating the likelihood requires discretizing the ODE and estimating the Jacobian trace at high accuracy. State-of-the-art molecular Boltzmann generators demand tight integration tolerances (e.g., $\mathrm{atol}=\mathrm{rtol}=10^{-5}$ ), yielding hundreds to thousands of ODE steps and correspondingly expensive network function evaluations— $\mathcal{O}(dT)$ per sample (Rehman et al., 10 Dec 2025).

2. Flow-Matching Training and Inference Bottlenecks

The standard flow-matching objective, as formalized by [Lipman et al. 2022], leverages a sampling procedure where $x_0 \sim p_0$ , $x_1 \sim p_1$ , and $s \sim \mathrm{Unif}[0,1]$ to create linear interpolants $x_s = s x_1 + (1-s) x_0$ . The model vector field $v_\theta(x_s, s)$ is trained by minimizing

$\mathcal{L}_{\mathrm{cfm}}(\theta) = \mathbb{E}_{x_0, x_1, s} \left[ w(s) \|v_\theta(x_s, s) - (x_1 - x_0)\|_2^2 \right].$

Although this objective sidesteps maximum-likelihood estimation during training, inference remains bottlenecked by the requirement of fine-grained ODE integration to evaluate likelihoods and log-density corrections, since the standard likelihood computation is still path-dependent.

3. FALCON: Hybrid Few-Step Flow Map and Training Objective

FALCON introduces a discrete-time, few-step "flow map" $X_u$ : $X_u(x_s, s, t) = x_s + (t - s) u_\theta(x_s, s, t),$ where $u_\theta$ is trained to approximate the integrated vector field of the underlying ODE over $(s, t)$ . This construction is accompanied by a hybrid loss: $\mathcal{L}(\theta) = \mathcal{L}_{\mathrm{cfm}}(\theta) + \lambda_{\mathrm{avg}} \mathcal{L}_{\mathrm{avg}}(\theta) + \lambda_r \mathcal{L}_{\mathrm{inv}}(\theta),$ with

$\mathcal{L}_{\mathrm{cfm}}$ : Standard flow-matching regression.
$\mathcal{L}_{\mathrm{avg}}$ : Average-velocity matching to enforce $u_\theta$ as an accurate mean-flow (MeanFlow-style, [Geng et al. 2025]), relating $u_\theta$ to the true time-averaged ODE velocity.
$\mathcal{L}_{\mathrm{inv}}$ : Cycle-consistency regularizer promoting invertibility by minimizing the expectation

$\mathbb{E}_{s, t, x_s} \| x_s - X_u(X_u(x_s, s, t), t, s) \|^2.$

Hyperparameters $\lambda_{\mathrm{avg}}$ and $\lambda_r$ control the balance between generation accuracy and invertibility.

4. Few-Step Sampling and Likelihood Computation

Sampling and likelihood calculation within FALCON proceeds as a sequence of updates along a user-defined schedule $0 = t_0 < t_1 < \dots < t_N = 1$ , typically with $N=4$ –$16$. At each step:

The particle state is updated via the flow map:

$x_i = X_u(x_{i-1}, t_{i-1}, t_i) = x_{i-1} + (t_i - t_{i-1}) u_\theta(x_{i-1}, t_{i-1}, t_i).$

The log-density is tracked by

$\ell_i = \ell_{i-1} - \log \left| \det \left( \partial_{x_{i-1}} X_u(x_{i-1}, t_{i-1}, t_i) \right) \right|.$

This procedure is $\mathcal{O}(N d)$ in cost for network evaluations, with $N$ as small as 4 yielding accurate enough likelihoods for self-normalized importance sampling (SNIS). Unlike CNFs, the expensive continuous trajectory integration and large numbers of function evaluations are circumvented (Rehman et al., 10 Dec 2025).

Step	Operation	Notes
(i)	$x_0 \sim p_0$ , $\ell_0 = \log p_0(x_0)$	Initial sample
(ii)	$x_i = X_u(\cdot)$ , $\ell_i = \ell_{i-1} - \log\|\cdot\|$	Iterative flow update and log-det correction
(iii)	Output $(x_N, \ell_N)$	Final sample and likelihood

5. Theoretical Guarantees

FALCON is accompanied by two central theoretical propositions [(Rehman et al., 10 Dec 2025), Appendix A.1–A.2]:

Proposition 1 (Average-Velocity Optimality): If $u_\theta$ perfectly matches the mean-velocity loss $\mathcal{L}_{\mathrm{avg}}$ , $X_u$ replicates the exact time- $[s,t]$ ODE flow map and is globally invertible. The discrete change-of-variables formula (log-density update) then holds exactly.
Proposition 2 (Invertibility Regularizer): Minimizing $\mathcal{L}_{\mathrm{inv}}$ alone is sufficient to guarantee that $X_u(\cdot, s, t)$ is invertible almost everywhere, ensuring the validity of the log-determinant correction in discrete likelihood calculations.

No explicit analytic error bounds as a function of the number of steps $N$ are given, but empirical results indicate that increasing $N$ rapidly reduces discretization error, and good accuracy is achieved with small $N$ .

6. Empirical Evaluation on Molecular Boltzmann Sampling

FALCON is evaluated on a suite of molecular systems under implicit solvent Amber force fields:

Alanine dipeptide (ALDP)
Tri-alanine (AL3)
Alanine tetrapeptide (AL4)
Hexa-alanine (AL6)

Baselines include discrete normalizing flows (SE(3)-EACF, RegFlow, SBG) and continuous flows (ECNF, ECNF++, BoltzNCE). Performance metrics are:

Effective Sample Size (ESS)
$2$-Wasserstein distance on energy histograms (E– $\mathcal{W}_2$ )
Torus $2$-Wasserstein on dihedral angles (T– $\mathcal{W}_2$ )
Wall-clock inference time and network function evaluations (NFE)

Results demonstrate that, for ALDP, FALCON achieves an ESS of $0.225$ (comparable to ECNF++'s $0.275$), with improved $\mathcal{W}_2$ distances (FALCON: $0.402$ vs.\ ECNF++: $0.914$; SBG: $0.873$). For larger systems (AL3/AL4/AL6), FALCON achieves ESS up to $\sim0.1$ versus ECNF++ at $<0.02$ , with lower Wasserstein distances. Inference time is improved by $100\times$ , with FALCON requiring only $4$–$16$ steps as opposed to $200$–$300$ for Dormand–Prince CNFs (Rehman et al., 10 Dec 2025).

7. Practical Limitations and Future Prospects

FALCON's discretization error is empirically calibrated; formal coverage of error versus step count $N$ is not provided. The approach does not yet achieve the one-step limit, with best results at $N=4$ –$8$. While invertibility is empirically satisfied (empirical $\ell_2$ reconstruction error $\approx 10^{-4}$ ), it is not a strict constraint during training and relies on convergence of the cycle-consistency loss. Future research directions highlighted include:

Structured-Jacobian architectures to further reduce the cost of Jacobian determinant evaluation,
Application to Bayesian inference, robotics, and complex posteriors,
Theoretical quantification of few-step discretization error.

FALCON unifies simulation-free flow matching with a fast, invertible few-step mapping, providing efficient and accurate likelihoods for importance sampling and likelihood-based downstream tasks in domains where CNF inference costs were previously prohibitive (Rehman et al., 10 Dec 2025).

PDF Markdown Chat (Pro)

References (1)

FALCON: Few-step Accurate Likelihoods for Continuous Flows (2025)

Whiteboard

Generate a whiteboard explanation of this topic.

Follow Topic

Get notified by email when new papers are published related to Few-step Accurate Likelihoods for Continuous Flows (FALCON).