Nakajima-Mori-Zwanzig (NMZ) Formulation

Updated 14 January 2026

Nakajima–Mori–Zwanzig formulation is a projection-operator framework that partitions observables into resolved and unresolved components, yielding exact evolution equations with memory effects.
It employs a memory kernel and fluctuating force to rigorously capture non-Markovian behavior in classical, quantum, and stochastic settings.
The approach underpins applications from turbulence modeling to quantum dynamics and deep learning, emphasizing systematic approximations and error control.

The Nakajima–Mori–Zwanzig (NMZ) Formulation is a mathematically rigorous framework for systematically reducing the dimensionality of complex dynamical systems, both classical and quantum, while rigorously capturing non-Markovian memory effects and noise. It is based on a projection-operator approach that splits the space of observables or states into “relevant” (resolved) and “irrelevant” (unresolved) components, yielding a hierarchy of exact or systematically approximated evolution equations for the resolved variables. The formalism originates in non-equilibrium statistical mechanics but has found diverse applications in fluid turbulence, stochastic differential equations, open quantum systems, climate delay modeling, uncertainty quantification, and, more recently, operator-theoretic analysis of machine learning.

1. Mathematical Structure and Operator Identities

The NMZ formalism begins with a high-dimensional deterministic or stochastic dynamical system, typically represented either as a set of ODEs/DDEs, partial differential equations, or Liouville/von Neumann equations for probability densities or quantum density operators. The central technical device is the introduction of a projection operator $P$ on the (Banach or Hilbert) space of observables, with its complement $Q=I-P$ . $P$ projects any observable/function/state onto the “resolved” subspace—e.g., a set of selected slow variables, low-frequency Fourier modes, or subsystem operators—while $Q$ projects onto the unresolved degrees of freedom.

Crucially, the NMZ identity (generalized Langevin equation, GLE) for the evolution of the resolved observables $A$ takes the form: $\frac{d}{dt} P\,A(t) = P\,L\,P\,A(t) + \int_0^t K(t-s) P\,A(s)\,ds + F(t)$ where:

$L$ is the Liouville (or Koopman/von Neumann) generator,
$K(t)$ is the memory kernel $K(t) = P\,L\,e^{t QL} QL P$ ,
$F(t)$ is the fluctuating force $F(t) = e^{t QL} QL (I-P)A(0)$ , orthogonal to the resolved subspace,
The memory term contains convolution with the past trajectory of the resolved variables and encodes non-Markovian effects (Parish et al., 2016, Dominy et al., 2016, Widder et al., 26 Mar 2025, Zhu et al., 2017).

This structure holds for autonomous and non-autonomous systems, as well as for discrete-time maps with propagator $U$ and the analogous kernel built from $PU(Q U)^\ell P$ in the discrete Mori–Zwanzig identity (Woodward et al., 2023).

In operator-algebraic terms, $P$ must be a conditional expectation onto a subalgebra for mathematical consistency and physical state preservation. This result establishes a duality between the NMZ equations for observables (Heisenberg picture) and for states (Schrödinger picture) (Dominy et al., 2016).

2. Projection Operators, Conditional Expectations, and Physical Interpretation

Rigorous developments show that physically relevant projections $P$ in the NMZ formalism are conditional expectations in the sense of operator algebras: they are idempotent, positive, and preserve the subalgebra of relevant observables. In concrete settings:

For classical systems, $P$ may be the conditional expectation over the unresolved variables, $Pf(\hat x, \tilde x) = E[f(\hat x, \tilde x)|\hat x]$ (Chorin et al., 2013).
For quantum systems, $P$ can be the partial trace over the environment, projecting operators to the subsystem algebra (Dominy et al., 2016).
In stochastic coarse-graining, $P$ averages over quasi-stationary distributions in a Markov state (Aristoff et al., 2023). Such projections are essential to guarantee completely positive and physically consistent reduced dynamics, especially in the quantum context (Dominy et al., 2016, Wilkie et al., 2011).

The memory term in NMZ is intimately tied to the nontrivial dynamical coupling between resolved and unresolved degrees of freedom. The fluctuating force $F(t)$ , which vanishes in expectation but is generally non-zero in each realization, encodes the uncertainty and stochasticity induced by unresolved modes. Generally, $F(t)$ is non-Markovian and non-Gaussian for nonlinear dynamics (Chorin et al., 2013).

3. Memory Kernels, Finite-Memory Approximations, and Model Reduction

Exact evaluation of the memory kernel $K(t)$ is intractable for nonlinear, high-dimensional dynamics. Several systematic approximations have been developed:

t-model/Fast memory: A leading-order (in time) expansion, replacing the convolution by $t P L Q L A(t)$ (Parish et al., 2016, Zhu et al., 2017).
Finite-Memory Models: Introduce auxiliary memory modes $w_j^{(m)}(t)$ corresponding to higher-order time derivatives, each satisfying a closed ODE with exponential decay rates. These allow recasting the non-Markovian system as a Markovian system in an expanded variable space, with memory lengths informed by the spectrum of the resolved Jacobian or decay rates of the orthogonal dynamics (Stinis, 2012, Parish et al., 2016, Parish et al., 2016).
Dynamic Memory Length: In fluid turbulence, the memory support $\tau_0$ is set heuristically as $\tau_0 \simeq C/\rho(\partial R / \partial \hat\phi)$ , with $C\approx 0.2$ fitted from Burgers equation (Parish et al., 2016).
Data-driven and Nonparametric Learning: Empirical estimation of memory kernels from time series is possible via regression on time-lagged correlation matrices, least-squares fits, or nonparametric estimation in Markov renewal coarse-graining (Lin et al., 2021, Aristoff et al., 2023, Woodward et al., 2023).
Hierarchical Approximations and Error Bounds: Rigorous a priori estimates and convergence results for truncations and finite-memory approximations are available, bounding the modeling error as a function of kernel decay and the chosen approximation order (Zhu et al., 2017).

Such techniques have proven crucial for obtaining tractable, yet accurate, surrogates for high-dimensional systems while retaining non-Markovian and noise contributions derived from first principles.

4. Applications in Classical, Quantum, and Data-Driven Contexts

The NMZ formulation has been exploited across a range of applications:

Turbulence and Large Eddy Simulation (LES): NMZ-derived subgrid closures, both t-model and finite-memory, systematically produce nonlocal closures whose kernel structure and time scales are dictated by the coarse-graining, with the final closure equation directly descended from the mathematics of projection (Parish et al., 2016, Parish et al., 2016). No ad hoc eddy viscosity ansatz appears, and the structure is closely related to the filtered dynamical system's numerical resolution.
Stochastic Processes and Coarse-Graining: In Markov processes, NMZ recasts the coarse dynamics as a Markov renewal process with memory kernels corresponding to jump-time distributions, facilitating arbitrarily accurate nonparametric coarse models when standard MSMs fail (Aristoff et al., 2023).
Hydrodynamics and Statistical Mechanics: NMZ provides the theoretical underpinning for transport theory, e.g., reproducing hydrodynamic correlation functions and Brillouin–Rayleigh spectral structure, yielding explicit predictions for memory-induced critical slowing down near QCD critical points (Kadam, 2022).
Delay Equations in Climate Modeling: NMZ gives principled derivation of nonlinear delay models for phenomena like ENSO, with memory integrals analyzed via orthogonal dynamics or approximate POD methods, producing physically justified delay terms where ad hoc phenomenological arguments were previously used (Falkena et al., 2019).
Quantum Non-Markovian Dynamics: NMZ master equations and their memory kernels underpin both Redfield-type and NIBA-type quantum master equations, delivering inhomogeneous, possibly non-completely-positive reduced state equations. Techniques to optimize projection operators and ensure positivity under mean-field closure have been established (Montoya-Castillo et al., 2016, Wilkie et al., 2011). Direct comparisons with time-convolutionless (TCL) approaches show that the NZ kernel structure can encode crucial dissipative channels absent from TCL (Smirne et al., 2010).
Deep Learning and Operator Theory: Recent operator-theoretic interpretations of deep neural networks (e.g., the Mori–Zwanzig formulation of deep learning) reveal that NMZ memory kernels correspond to the effective “memory” of the network and produce exact recurrence relations for propagating functional expectations through layers, with memory decay conditions essential for network reduction (Venturi et al., 2022).
Reduced Order and Data-Driven Modal Decomposition: NMZ-based mode decomposition (MZMD) generalizes DMD by incorporating memory terms and is empirically more robust and accurate for high-dimensional nonlinear systems whose observables do not form a Koopman-invariant subspace (Woodward et al., 2023).

5. Mathematical Foundations, Dualities, and Exactness

The NMZ decomposition is deeply connected to semigroup theory and operator algebras:

The existence and uniqueness of the GLE and the identification of the memory kernel and the fluctuating force follow from the solvability of associated Volterra equations. This establishes the GLE and second fluctuation–dissipation theorem (2FDT) as rigorous consequences of operator semigroup theory, independent of Dyson identity or orthogonal-dynamics heuristics (Widder et al., 26 Mar 2025).
The fluctuating force is characterized as the unique mild solution of an abstract Cauchy problem generated by the closure of $\mathcal{Q}L\mathcal{Q}$ , and in Hamiltonian settings, is stationary due to the unitarity of the group evolution (Widder et al., 26 Mar 2025).
Exact duality is established between the Heisenberg-picture NMZ (for observables) and the Schrödinger-picture equation (for projected states), with the associated conditional expectation property of the projection necessary for state preservation (Dominy et al., 2016).

For time-dependent systems, extensions incorporate time-dependent projectors and Liouvillians, right-time-ordered exponentials, and generalized correlators, leading to evolution equations for observables and states with two-time memory kernels $K(t,s)$ and explicit time-dependence (Vrugt et al., 2019).

6. Limitations, Approximations, and Error Control

Major modeling and analytic challenges include:

Intractability of orthogonal dynamics and kernel evaluation for nonlinear/high-dimensional unresolved subsystems.
Necessity and validity of finite-memory truncation, and error bounds for memory approximations.
Preservation of physical structure: complete positivity in quantum open systems, state conservation, and equilibration.
Parameter estimation in surrogate models (e.g., memory lengths, kernel decay rates), where data-driven or physically motivated heuristics (spectral radii, stability requirements) are used (Parish et al., 2016, Stinis, 2012, Zhu et al., 2017).
Trade-offs between perfect reproduction of past dynamics (memory retention) and efficient, Markovianized system closure—especially in the presence of strong coupling or in regimes lacking clear scale separation.

Despite these challenges, the NMZ framework provides a mathematically exact, highly systematic, and physically interpretable lens for model reduction, non-Markovian effects, and memory-driven uncertainty quantification across a range of mathematical physics, computational science, and emerging machine learning domains.