Lévy–Prokhorov Metric
- Lévy–Prokhorov Metric is a probability metric that quantifies the distance between Borel measures on a metric space and metrizes weak convergence.
- Its formulations via coupling, optimal transport, and test-function dualities provide versatile frameworks for analyzing convergence and stability in probability theory.
- Applications in statistical inference, machine learning, and functional data analysis highlight its role in robustly characterizing distributional shifts and geometric rigidity.
The Lévy–Prokhorov metric () is a probability metric characterizing the distance between Borel probability measures on a metric space. It metrizes weak convergence of measures, underlies the topology of weak convergence (often called Prokhorov topology), and admits several optimal transport and duality formulations. It appears equivalently as the Prokhorov or Ky Fan metric, and supports both coupling-based and test-function dualities. Its rigidity properties distill isometries of measure spaces to push-forwards by affine isometries, making it fundamental in probability theory, stochastic processes, statistical inference, functional data analysis, and machine learning.
1. Definition and Formal Structure
Let be a complete separable metric space, and the set of Borel probability measures on . For and , the –neighborhood of a Borel set is .
The Lévy–Prokhorov distance is
Key properties:
- 0 iff 1.
- 2 for all 3.
- If 4 is complete and separable, so is 5.
- 6 induces the topology of weak convergence on probability measures (Gehér et al., 2017, Pakshirajan et al., 2021, Aolaritei et al., 19 Feb 2025, Zhou et al., 2020, Kutta et al., 26 Jun 2025).
Several equivalent formulations exist:
- Coupling/transport plan: 7 where 8 is a coupling of 9 and 0 (Abraham et al., 2012, Aolaritei et al., 19 Feb 2025).
- Random variable interpretation: 1 iff there exist random variables 2 with 3, 4 bounded by 5, 6, 7 arbitrary (Aolaritei et al., 19 Feb 2025).
- Predicate lifting: For discrete distributions, 8, known also as the Ky Fan metric (Wild et al., 27 Oct 2025).
2. Metrization of Weak Convergence and Topological Properties
On a Polish space (complete, separable metric), 9 metrizes weak convergence: 0 in 1 (weakly) (Gehér et al., 2017, Pakshirajan et al., 2021, Zhou et al., 2020, Aolaritei et al., 19 Feb 2025, Kutta et al., 26 Jun 2025). The induced topology on 2 is exactly weak convergence:
- On 3, 4 (Lévy distance between cdfs), and both metrize weak convergence of distribution functions (Pakshirajan et al., 2021).
- On locally finite measures, extensions integrate Prokhorov distances over expanding balls, yielding Polish metric spaces for rooted, locally compact spaces (Abraham et al., 2012).
- Quantitative relationships: For any 5, 6 (Wasserstein distance) (Kutta et al., 26 Jun 2025).
Invariance and rigidity:
- The isometry group of 7 is isomorphic to the affine‐isometry group of 8 (Gehér et al., 2017).
- Nontrivial measure-preserving isometries arise only from affine isometries of 9; this is central for statistical and geometric characterizations in stochastic processes.
3. Optimal-Transport, Duality, and Variants
The Lévy–Prokhorov distance admits multiple dual and transport formulations:
- Wasserstein-style (coupling): 0 iff there exists a coupling 1 such that 2; equivalently, 3 where 4 is the cost for mass exceeding distance 5 (Aolaritei et al., 19 Feb 2025, Abraham et al., 2012, Wild et al., 27 Oct 2025).
- Price-function/Kantorovich lifting: LP distance can be cast as a supremum over non-expansive test functions, via a single “generally” modality 6 (Wild et al., 27 Oct 2025).
- Generalized Kantorovich–Rubinstein duality: For the LP metric, coupling-based and Kantorovich (test-function) forms coincide for all pseudometrics; this does not hold for 7-Wasserstein with 8 (Wild et al., 27 Oct 2025).
Comparison with other metrics:
- LP balls decompose as 9 capturing both local (Wasserstein-0) and global (Total Variation) perturbations (Aolaritei et al., 19 Feb 2025).
- For 1: 2 for appropriately chosen 3.
4. Isometries of Measure Spaces and Geometric Rigidity
A characterization of surjective 4–isometries for separable Banach spaces 5 holds: a bijection 6 is a 7–isometry iff there exists a surjective affine isometry 8 and 9 for all Borel 0 (Gehér et al., 2017). Key technical tools in the proof include:
- Inductive analysis of finitely-supported measures using witness functions 1.
- The introduction of 2–Lévy–Prokhorov metrics to separate atoms of measures.
- Convex-geometric techniques analyzing supports in Banach spaces.
Implications:
- The only 3–isometries are push-forwards under surjective affine isometries.
- This extends Banach–Stone–type results previously established for scalar-valued (one-dimensional) measures (Gehér et al., 2017).
- Measure spaces 4 are metrically rigid, essential for isometric classification in random element theory and probabilistic invariance principles.
5. Applications in Probability, Statistics, and Machine Learning
Probability Limit Theory and Functional Data Analysis
- The LP metric is critical in expressing quantitative central limit theorems, including rates of convergence for Gaussian approximation in both univariate and functional (infinite-dimensional) settings (Zhou et al., 2020).
- Explicit convergence rates and constants are given in both classic and sublinear expectation frameworks: For the partial sum process 5 to Brownian motion 6, 7 with explicit 8 (Zhou et al., 2020).
- In functional data analysis, LP bounds allow for strong-invariance and coupling results, essential for change-point detection and distributional approximation under weak dependence (Kutta et al., 26 Jun 2025).
Robust Conformal Prediction
- In conformal prediction, LP ambiguity sets naturally model both local (bounded) and global (outlier) distribution shifts (Aolaritei et al., 19 Feb 2025).
- Propagation through Lipschitz scoring functions yields tractable univariate LP balls, facilitating exact worst-case quantiles and coverage.
- Relation to TV and Wasserstein balls allows flexible robustness modeling.
Behavioral Metrics and Coalgebraic Distances
- In Markov process theory, the LP lifting defines 9-distance, coinciding with the maximal fixpoint of a behavioral distance functional and capturing approximate bisimulation (Desharnais et al., 14 Jul 2025).
- Unlike the Kantorovich lifting, LP lifting is locally non-expansive and enables efficient computation of behaviorally defined distances.
- Coalgebraic characterizations of 0-couplings/bisimulations arise naturally from the LP structure.
Metric Geometry and Measured Trees
- The Gromov–Hausdorff–Prokhorov metric combines LP with Hausdorff distance to metrize spaces of compact or locally compact measured metric spaces (e.g., real trees) (Abraham et al., 2012).
- The resulting space is Polish (complete, separable), with precompactness determined by bounded diameters, net sizes, and total mass.
- LP metric ensures continuous dependence of tree laws on coding functions—crucial in random geometry and continuum random tree theory.
6. Computational, Structural, and Theoretical Properties
- The set-expansion definition is computationally intractable in high dimensions, but the coupling formulation reduces to mixed-integer or flow-type programs, and the two-step W1+TV decomposition is efficient for empirical tasks (Aolaritei et al., 19 Feb 2025).
- LP balls encode local-global perturbations, with the ability to decompose distributional shifts precisely between bounded transport and TV mass movement (Aolaritei et al., 19 Feb 2025).
- The LP metric underlies stability results, e.g., for conformal predictors and in machine learning settings requiring robustness to outliers or non-local rearrangement (Wild et al., 27 Oct 2025).
- The Ky Fan metric, identical to LP on discrete supports, is key in several logic and bisimulation settings (Wild et al., 27 Oct 2025).
7. Summary Table: Comparative Features with Leading Probability Metrics
| Metric | Weak Convergence | Coupling Duality | Local+Global Shift Decomposition | Isometry Rigidity |
|---|---|---|---|---|
| Lévy–Prokhorov | Yes | Yes | Yes (W2+TV) | Yes (Banach Rx) |
| Wasserstein 3 | Yes for 4 | Yes | No | No |
| Total Variation | No | Yes | Only global | No |
| Kolmogorov | No | No | No | No |
References
- (Gehér et al., 2017) A characterisation of isometries with respect to the Lévy-Prokhorov metric
- (Pakshirajan et al., 2021) An expository note on Prohorov metric and Prohorov Theorem
- (Abraham et al., 2012) A note on Gromov-Hausdorff-Prokhorov distance between (locally) compact measure spaces
- (Zhou et al., 2020) Prokhorov distance with rates of convergence under sublinear expectations
- (Kutta et al., 26 Jun 2025) Prokhorov Metric Convergence of the Partial Sum Process for Reconstructed Functional Data
- (Aolaritei et al., 19 Feb 2025) Conformal Prediction under Levy-Prokhorov Distribution Shifts: Robustness to Local and Global Perturbations
- (Desharnais et al., 14 Jul 2025) 5-Distance via Lévy-Prokhorov Lifting
- (Wild et al., 27 Oct 2025) Generalized Kantorovich-Rubinstein Duality beyond Hausdorff and Kantorovich