Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
131 tokens/sec
GPT-4o
10 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

JKO schemes with general transport costs (2402.17681v1)

Published 27 Feb 2024 in math.AP and math.PR

Abstract: We modify the JKO scheme, which is a time discretization of Wasserstein gradient flows, by replacing the Wasserstein distance with more general transport costs on manifolds. We show when the cost function has a mixed Hessian which defines a Riemannian metric, our modified JKO scheme converges under suitable conditions to the corresponding Riemannian Fokker--Planck equation. Thus on a Riemannian manifold one may replace the (squared) Riemannian distance with any cost function which induces the metric. Of interest is when the Riemannian distance is computationally intractable, but a suitable cost has a simple analytic expression. We consider the Fokker--Planck equation on compact submanifolds with the Neumann boundary condition and on complete Riemannian manifolds with a finite drift condition. As an application we consider Hessian manifolds, taking as a cost the Bregman divergence.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (57)
  1. From a large-deviations principle to the Wasserstein gradient flow: a new micro-macro passage. Communications in Mathematical Physics, 307:791–815, 2011.
  2. Martial Agueh. Existence of solutions to degenerate parabolic equations via the Monge-Kantorovich theory. Advances in Differential Equations, 10(3):309–360, 2005.
  3. Hessian Riemannian gradient flows in convex programming. SIAM Journal on Control and Optimization, 43(2):477–501, 2004.
  4. Shun-Ichi Amari. Information Geometry and Its Applications. Springer, 2016.
  5. Gradient Flows in Metric Spaces and in the Space of Probability Measures. Lectures in Mathematics ETH Zürich. Birkhäuser Verlag, Basel, second edition, 2008.
  6. Input convex neural networks. In International Conference on Machine Learning, pages 146–155. PMLR, 2017.
  7. Clustering with Bregman divergences. Journal of Machine Learning Research, 6(Oct):1705–1749, 2005.
  8. Ole Barndorff-Nielsen. Information and Exponential Families in Statistical Theory. John Wiley & Sons, 2014.
  9. Michéle Basseville. Divergence measures for statistical data processing—an annotated bibliography. Signal Processing, 93(4):621–633, 2013.
  10. Mirror descent and nonlinear projected subgradient methods for convex optimization. Operations Research Letters, 31(3):167–175, 2003.
  11. A geometric approach to apriori estimates for optimal transport maps. arXiv preprint arXiv:2311.10208, 2023.
  12. Yann Brenier. Polar factorization and monotone rearrangement of vector-valued functions. Communications on Pure and Applied Mathematics, 44(4):375–417, 1991.
  13. Proximal optimal transport modeling of population dynamics. In International Conference on Artificial Intelligence and Statistics, pages 6511–6528. PMLR, 2022.
  14. Geometric Modeling in Probability and Statistics. Springer, 2014.
  15. An explicit solution of information geodesic equations for the multivariate normal model. Statistics & Risk Modeling, 9(1-2):119–138, 1991.
  16. On Monge’s problem for Bregman-like cost functions. Journal of Convex Analysis, 14(3):647–655, 2007.
  17. Wasserstein mirror gradient flow as the limit of the Sinkhorn algorithm, 2023. arXiv 2307.16421.
  18. Matthias Erbar. The heat equation on manifolds as a gradient flow in the Wasserstein space. Ann. Inst. Henri Poincaré Probab. Stat., 46(1):1–23, 2010.
  19. A variational method for a class of parabolic PDEs. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5), 10(1):207–252, 2011.
  20. A new transportation distance between non-negative measures, with applications to gradients flows with Dirichlet boundary conditions. J. Math. Pures Appl. (9), 94(2):107–130, 2010.
  21. An Invitation to Optimal Transport, Wasserstein distances, and Gradient Flows. EMS Press, 2021.
  22. Avner Friedman. Partial Differential Equations of Parabolic Type. Prentice-Hall, Inc., Englewood Cliffs, NJ, 1964.
  23. Hopfield neural network flow: A geometric viewpoint. IEEE Transactions on Neural Networks and Learning Systems, 31(11):4869–4880, 2020.
  24. The variational formulation of the fokker–planck equation. SIAM Journal on Mathematical Analysis, 29(1):1–17, 1998.
  25. Olav Kallenberg. Foundations of Modern Probability, volume 1. Springer, third edition.
  26. Olav Kallenberg. Foundations of Modern Probability, volume 2. Springer, third edition.
  27. Continuity, curvature, and the general covariance of optimal transportation. Journal of the European Mathematical Society, 12(4):1009–1040, 2010.
  28. John M Lee. Introduction to Riemannian Manifolds. Springer, second edition, 2018.
  29. Gradient descent with a general cost, 2023. arXiv 2305.04917.
  30. Regularity of potential functions of the optimal transportation problem. Archive for Rational Mechanics and Analysis, 177(2):151–183, 2005.
  31. Optimal transport mapping via input convex neural networks. In International Conference on Machine Learning, pages 6672–6681. PMLR, 2020.
  32. Takao Matumoto. Any statistical manifold has a contrast function – on the C3superscript𝐶3{C}^{3}italic_C start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT-function taking the minimum at the diagonal of the product manifold. Hiroshima Mathematical Journal, 23(2):327–332, 1993.
  33. Robert J McCann. A convexity principle for interacting gases. Advances in Mathematics, 128(1):153–179, 1997.
  34. Robert John McCann. A convexity theory for interacting gases and equilibrium crystals. ProQuest LLC, Ann Arbor, MI, 1994. Thesis (Ph.D.)–Princeton University.
  35. Large-scale Wasserstein gradient flows. Advances in Neural Information Processing Systems, 34:15243–15256, 2021.
  36. Contraction of general transportation costs along solutions to Fokker-Planck equations with monotone drifts. J. Math. Pures Appl. (9), 95(1):18–35, 2011.
  37. Barrett O’Neill. Semi-Riemannian Geometry with Applications to Relativity. Academic Press, 1983.
  38. Felix Otto. Dynamics of labyrinthine pattern formation in magnetic fluids: a mean-field theory. Arch. Rational Mech. Anal., 141(1):63–103, 1998.
  39. Felix Otto. The geometry of dissipative evolution equations: the porous medium equation. Comm. Partial Differential Equations, 26(1-2):101–174, 2001.
  40. Soumik Pal. On the difference between entropic cost and the optimal transport cost. The Annals of Applied Probability, 34(1B):1003–1028, 2024.
  41. Multiplicative Schröodinger problem and the Dirichlet transport. Probability Theory and Related Fields, 178(1):613–654, 2020.
  42. Gabriel Peyré. Entropic approximation of Wasserstein gradient flows. SIAM Journal on Imaging Sciences, 8(4):2323–2351, 2015.
  43. Cale Rankin. A remark on the geometric interpretation of the A3w condition from optimal transport. Bull. Aust. Math. Soc., 108(1):162–165, 2023.
  44. Bregman-Wasserstein divergence: geometry and applications. arXiv preprint arXiv:2302.05833, 2023.
  45. R Tyrrell Rockafellar. Convex Analysis. Princeton University Press, 1970.
  46. Filippo Santambrogio. Optimal Transport for Applied Mathematicians. Birkäuser, 2015.
  47. Filippo Santambrogio. {{\{{Euclidean, metric, and Wasserstein}}\}} gradient flows: an overview. Bulletin of Mathematical Sciences, 7(1):87–154, 2017.
  48. Giuseppe Savaré. Gradient flows and diffusion semigroups in metric spaces under lower curvature bounds. C. R. Math. Acad. Sci. Paris, 345(3):151–154, 2007.
  49. Hirohiko Shima. The Geometry of Hessian Structures. World Scientific, 2007.
  50. Geometry of Hessian manifolds. Differential geometry and its applications, 7(3):277–290, 1997.
  51. On strict convexity and continuous differentiability of potential functions in optimal transportation. Arch. Ration. Mech. Anal., 192(3):403–418, 2009.
  52. Cédric Villani. Topics in Optimal Transportation. American Mathematical Society, 2003.
  53. Cédric Villani. Optimal Transport: Old and New. Springer, 2008.
  54. A variational perspective on accelerated methods in optimization. Proceedings of the National Academy of Sciences, 113(47):E7351–E7358, 2016.
  55. Pseudo-Riemannian geometry encodes information geometry in optimal transport. Information Geometry, 5(1):131–159, 2022.
  56. Tsallis and Rényi deformations linked via a new λ𝜆\lambdaitalic_λ-duality. IEEE Transactions on Information Theory, 68(8):5353–5373, 2022.
  57. Xicheng Zhang. Variational approximation for Fokker-Planck equation on Riemannian manifold. Probability Theory and Related Fields, 137(3-4):519–539, 2007.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com