Unpaired Image-to-Image Translation via Neural Schrödinger Bridge (2305.15086v3)
Abstract: Diffusion models are a powerful class of generative models which simulate stochastic differential equations (SDEs) to generate data from noise. While diffusion models have achieved remarkable progress, they have limitations in unpaired image-to-image (I2I) translation tasks due to the Gaussian prior assumption. Schr\"{o}dinger Bridge (SB), which learns an SDE to translate between two arbitrary distributions, have risen as an attractive solution to this problem. Yet, to our best knowledge, none of SB models so far have been successful at unpaired translation between high-resolution images. In this work, we propose Unpaired Neural Schr\"{o}dinger Bridge (UNSB), which expresses the SB problem as a sequence of adversarial learning problems. This allows us to incorporate advanced discriminators and regularization to learn a SB between unpaired data. We show that UNSB is scalable and successfully solves various unpaired I2I translation tasks. Code: \url{https://github.com/cyclomon/UNSB}
- Mutual information neural estimation. In ICML, 2018.
- One-sided unsupervised domain mapping. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (eds.), Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc., 2017. URL https://proceedings.neurips.cc/paper/2017/file/59b90e1005a220e2ebc542eb9d950b1e-Paper.pdf.
- Diffusion schrödinger bridge with applications to score-based generative modeling. In NeurIPS, 2021.
- The schrödinger bridge between gaussian measures has a closed form. In AISTATS, 2023.
- Reflected schrödinger bridge: Density control with path constraints. In 2021 American Control Conference (ACC), pp. 1137–1142. IEEE, 2021.
- Reusing discriminators for encoding: Towards unsupervised image-to-image translation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 8168–8177, 2020.
- Likelihood training of schrödinger bridge using forward-backward sdes theory. In ICLR, 2022.
- Stochastic control liaisons: Richard sinkhorn meets gaspard monge on a schrodinger bridge. Siam Review, 63(2):249–313, 2021.
- Stargan v2: Diverse image synthesis for multiple domains. In CVPR, 2022.
- Diffusion posterior sampling for general noisy inverse problems. In ICLR, 2023.
- Inversion by direct iteration: An alternative to denoising diffusion for image restoration. arxiv preprint arXiv:2303.11435, 2023.
- Nice: Non-linear independent components estimation. arxiv preprint arXiv:1410.8516, 2015.
- Geometry-consistent generative adversarial networks for one-sided unsupervised domain mapping. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
- Generative adversarial nets. In NeurIPS, 2014.
- Entropic neural optimal transport via diffusion processes. In NeurIPS, 2023a.
- Entropic neural optimal transport via diffusion processes. In NeurIPS, 2023b.
- Building the bridge of schrödinger: A continuous entropic optimal transport benchmark. In NeurIPS Track on Datasets and Benchmarks, 2023c.
- Prompt-to-prompt image editing with cross attention control. arXiv preprint arXiv:2208.01626, 2022.
- Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems, 30, 2017.
- Denoising diffusion probabilistic models. In NeurIPS, 2020.
- Multimodal unsupervised image-to-image translation. In Proceedings of the European Conference on Computer Vision (ECCV), September 2018.
- Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1125–1134, 2017.
- Exploring patch-wise semantic relation for contrastive learning in image-to-image translation tasks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 18260–18269, June 2022.
- Progressive growing of GANs for improved quality, stability, and variation. In ICLR, 2018.
- Auto-encoding variational bayes. In ICLR, 2014.
- Neural optimal transport. In ICLR, 2023.
- Christian Léonard. A survey of the schrödinger problem and some of its connections with optimal transport. arXiv preprint arXiv:1308.0215, 2013.
- Ar-dae: Towards unbiased neural entropy estimation. In ICML, 2020.
- Deep generalized schrödinger bridge. arXiv preprint arXiv:2209.09893, 2022.
- I2sb: Image-to-image schrödinger bridge. arxiv preprint arXiv:2302.05872, 2023.
- Sdedit: Guided image synthesis and editing with stochastic differential equations. In ICLR, 2022.
- Contrastive learning for unpaired image-to-image translation. In Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (eds.), Computer Vision – ECCV 2020, pp. 319–345, Cham, 2020. Springer International Publishing. ISBN 978-3-030-58545-7.
- Multisample flow matching: Straightening flows with minibatch couplings. In ICML, 2023.
- Paolo Dai Pra. A stochastic control appraoch to reciprocal diffusion processes. Applied Mathematics and Optimization, 23(1):313–329, 1991.
- High-resolution image synthesis with latent diffusion models. In CVPR, 2022.
- Unbiased estimation using a class of diffusion processes. Journal of Computational Physics, 472:111643, 2023.
- Can push-forward generative models fit multimodal distributions? In NeurIPS, 2022.
- Erwin Schrödinger. Sur la théorie relativiste de l’électron et l’interprétation de la mécanique quantique. Annales de l’institut Henri Poincaré, 2(4):269–310, 1932.
- Conditional simulation using diffusion schrödinger bridges. In Uncertainty in Artificial Intelligence, pp. 1792–1802. PMLR, 2022.
- Diffusion schrödinger bridge matching. In NeurIPS, 2023.
- Deep unsupervised learing using nonequilibrium thermodynamics. In ICML, 2015.
- Denoising diffusion implicit models. In ICLR, 2021a.
- Score-based generative modeling through stochastic differential equations. In ICLR, 2021b.
- Dual diffusion implicit bridges for image-to-image translation. In ICLR, 2023.
- Transport with support: Data-conditional diffusion bridges. arXiv preprint arXiv:2301.13636, 2023.
- Riemannian diffusion schrödinger bridge. arXiv preprint arXiv:2207.03024, 2022.
- Conditional flow matching: Simulation-free dynamic optimal transport. arxiv preprint arXiv:2302.00482, 2023.
- Solving schrödinger bridges via maximum likelihood. Entropy, 23(9):1134, 2021.
- Deep generative learning via schrödinger bridge. In International Conference on Machine Learning, pp. 10794–10804. PMLR, 2021a.
- Instance-wise hard negative example generation for contrastive learning in unpaired image-to-image translation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 14020–14029, October 2021b.
- Tackling the generative learning trilemma with denoising diffusion gans. In ICLR, 2022.
- Path integral sampler: a stochastic control approach for sampling. arXiv preprint arXiv:2111.15141, 2021.
- Egsde: Unpaired image-to-image translation via energy-guided stochastic differential equations. In NeurIPS, 2022.
- The spatially-correlative loss for various image translation tasks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 16407–16417, June 2021.
- Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Oct 2017.