Nonequilbrium physics of generative diffusion models (2405.11932v3)
Abstract: Generative diffusion models apply the concept of Langevin dynamics in physics to machine leaning, attracting a lot of interests from engineering, statistics and physics, but a complete picture about inherent mechanisms is still lacking. In this paper, we provide a transparent physics analysis of diffusion models, formulating the fluctuation theorem, entropy production, equilibrium measure, and Franz-Parisi potential to understand the dynamic process and intrinsic phase transitions. Our analysis is rooted in a path integral representation of both forward and backward dynamics, and in treating the reverse diffusion generative process as a statistical inference, where the time-dependent state variables serve as quenched disorder akin to that in spin glass theory. Our study thus links stochastic thermodynamics, statistical inference and geometry based analysis together to yield a coherent picture about how the generative diffusion models work.
- Deep Learning. MIT Press, Cambridge, MA, 2016.
- Haiping Huang. Statistical Mechanics of Neural Networks. Springer, Singapore, 2022.
- Deep unsupervised learning using nonequilibrium thermodynamics. In International conference on machine learning, pages 2256–2265. PMLR, 2015.
- Denoising diffusion probabilistic models. Advances in neural information processing systems, 33:6840–6851, 2020.
- Generative modeling by estimating gradients of the data distribution. Advances in neural information processing systems, 32, 2019.
- Score-based generative modeling through stochastic differential equations. arXiv:2011.13456, 2020.
- Hannes Risken. The Fokker-Planck Equation. Springer, Berlin, 1996.
- N.G. Van Kampen. Stochastic Processes in Physics and Chemistry. 3rd ed., North-Holland Personal Library, North-Holland, Amsterdam, 2007.
- Udo Seifert. Stochastic thermodynamics, fluctuation theorems and molecular machines. Reports on Progress in Physics, 75:126001, 2012.
- Introduction to dynamical mean-field theory of randomly connected neural networks with bidirectionally correlated couplings. SciPost Phys. Lect. Notes, page 79, 2024.
- Generative diffusion in very large dimensions. Journal of Statistical Mechanics: Theory and Experiment, 2023(9):093402, oct 2023.
- Spontaneous symmetry breaking in generative diffusion models. In A. Oh, T. Neumann, A. Globerson, K. Saenko, M. Hardt, and S. Levine, editors, Advances in Neural Information Processing Systems, volume 36, pages 66377–66389. Curran Associates, Inc., 2023.
- Dynamical regimes of diffusion models. arXiv:2402.18491, 2024.
- Sampling with flows, diffusion and autoregressive neural networks: A spin-glass perspective. arXiv:2308.14085, 2023.
- Luca Ambrogioni. The statistical thermodynamics of generative diffusion models: Phase transitions, symmetry breaking and critical instability. arXiv:2310.17467, 2023.
- Understanding diffusion models by feynman’s path integral. arXiv:2403.11262, 2024.
- Minimal model of permutation symmetry in unsupervised learning. Journal of Physics A: Mathematical and Theoretical, 52:414001, 2019.
- Statistical physics of unsupervised learning with prior knowledge in neural networks. Phys. Rev. Lett., 124:248302, 2020.
- Stochastic thermodynamics: an introduction. Princeton University Press, 2021.
- Albert Einstein. The theory of the brownian movement. Ann der Physik, 17:549, 1905.
- Learning mixtures of gaussians using the ddpm objective. arXiv:2307.01178, 2023.
- Rules of calculus in the path integral representation of white noise langevin equations: the onsager–machlup approach. Journal of Physics A: Mathematical and Theoretical, 50(34):345001, 2017.
- Udo Seifert. Entropy production along a stochastic trajectory and an integral fluctuation theorem. Phys. Rev. Lett., 95:040602, Jul 2005.
- Tânia Tomé. Entropy production in nonequilibrium systems described by a fokker-planck equation. Brazilian Journal of Physics, 36:1285–1289, 2006.
- Brian DO Anderson. Reverse-time diffusion equation models. Stochastic Processes and their Applications, 12(3):313–326, 1982.
- Pascal Vincent. A connection between score matching and denoising autoencoders. Neural Computation, 23(7):1661–1674, 2011.
- Effective potential in glassy systems: theory and simulations. Physica A: Statistical Mechanics and its Applications, 261(3):317–339, 1998.
- Origin of the computational hardness for learning with binary synapses. Phys. Rev. E, 90:052813, 2014.
- Spin Glass Theory and Beyond. World Scientific, Singapore, 1987.
- Video generation models as world simulators. 2024.