Residual-Conditioned Optimal Transport: Towards Structure-Preserving Unpaired and Paired Image Restoration (2405.02843v2)
Abstract: Deep learning-based image restoration methods generally struggle with faithfully preserving the structures of the original image. In this work, we propose a novel Residual-Conditioned Optimal Transport (RCOT) approach, which models image restoration as an optimal transport (OT) problem for both unpaired and paired settings, introducing the transport residual as a unique degradation-specific cue for both the transport cost and the transport map. Specifically, we first formalize a Fourier residual-guided OT objective by incorporating the degradation-specific information of the residual into the transport cost. We further design the transport map as a two-pass RCOT map that comprises a base model and a refinement process, in which the transport residual is computed by the base model in the first pass and then encoded as a degradation-specific embedding to condition the second-pass restoration. By duality, the RCOT problem is transformed into a minimax optimization problem, which can be solved by adversarially training neural networks. Extensive experiments on multiple restoration tasks show that RCOT achieves competitive performance in terms of both distortion measures and perceptual quality, restoring images with more faithful structures as compared with state-of-the-art methods.
- Ntire 2017 challenge on single image super-resolution: Dataset and study. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 126–135, 2017.
- O-haze: a dehazing benchmark with real hazy and haze-free outdoor images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 754–762, 2018.
- Contour detection and hierarchical image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 33(5):898–916, 2010.
- Ambientgan: Generative models from lossy measurements. In International Conference on Machine Learning (ICML), 2018.
- Learning a sparse transformer network for effective image deraining. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5896–5905, 2023.
- Restoration based generative models. In International Conference on Machine Learning (ICML), pp. 5787–5816, 2023.
- Rocgan: Robust conditional gan. International Journal of Computer Vision (IJCV), 128:2665–2683, 2020.
- Nafssr: Stereo image super-resolution using nafnet. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 1239–1248, June 2022.
- Diffusion posterior sampling for general noisy inverse problems. In International Conference on Learning Representations (ICLR), 2023.
- Irnext: Rethinking convolutional network design for image restoration. In International Conference on Machine Learning (ICML), 2023a.
- Selective frequency network for image restoration. In The Eleventh International Conference on Learning Representations (ICLR), 2023b.
- A general decoupled learning framework for parameterized image operators. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 43(1):33–47, 2019.
- Franzen, R. Kodak lossless true color image suite. source: http://r0k. us/graphics/kodak, 4(2):9, 1999.
- Removing rain from single images via a deep detail network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3855–3863, 2017.
- Implicit diffusion models for continuous super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10021–10030, 2023.
- Optimal transport-guided conditional score-based diffusion model. In Advances in Neural Information Processing Systems (NeurIPS), 2023.
- Single image haze removal using dark channel prior. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 33(12):2341–2353, 2010.
- Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in Neural Information Processing Systems (NeurIPS), 30, 2017.
- Kantorovich, L. V. On the translocation of masses. In Dokl. Akad. Nauk. USSR (NS), volume 37, pp. 199–201, 1942.
- Snips: Solving noisy inverse problems stochastically. Advances in Neural Information Processing Systems (NeurIPS), 34:21757–21769, 2021.
- Denoising diffusion restoration models. In Advances in Neural Information Processing Systems (NeurIPS), 2022.
- Kernel neural optimal transport. In International Conference on Learning Representations (ICLR), 2023a. URL https://openreview.net/forum?id=Zuc_MHtUma4.
- Neural optimal transport. In International Conference on Learning Representations (ICLR), 2023b. URL https://openreview.net/forum?id=d8CBRlWNkqH.
- Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 4681–4690, 2017.
- Benchmarking single-image dehazing and beyond. IEEE Transactions on Image Processing (TIP), 28(1):492–505, 2018a.
- Single image dehazing via conditional generative adversarial network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 8202–8211, 2018b.
- Rain streak removal using layer priors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2736–2744, 2016.
- Swinir: Image restoration using swin transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 1833–1844, 2021.
- Image restoration with mean-reverting stochastic differential equations. International Conference on Machine Learning (ICML), 2023.
- Waterloo exploration database: New challenges for image quality assessment models. IEEE Transactions on Image Processing (TIP), 26(2):1004–1016, 2016.
- A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), volume 2, pp. 416–423, 2001.
- Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784, 2014.
- Monge, G. Mémoire sur la théorie des déblais et des remblais. Mem. Math. Phys. Acad. Royale Sci., pp. 666–704, 1781.
- GibbsDDRM: A partially collapsed gibbs sampler for solving blind inverse problems with denoising diffusion restoration. In International Conference on Machine Learning (ICML), 2023.
- Physics-based generative adversarial models for image restoration and beyond. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 43(7):2449–2462, 2020.
- Exploiting deep generative prior for versatile image restoration and manipulation. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 44(11):7474–7489, 2021.
- Promptir: Prompting for all-in-one blind image restoration. Advances in Neural Information Processing Systems (NeurIPS), 2023.
- Single image dehazing via multi-scale convolutional neural networks. In Proceedings of the European Conference on Computer Vision (ECCV), pp. 154–169, 2016.
- Rockafellar, R. Integral functionals, normal integrands and measurable selections. Nonlinear Operators and the Calculus of Variations, pp. 157–207, 1976.
- Palette: Image-to-image diffusion models. In ACM SIGGRAPH 2022 Conference Proceedings, pp. 1–10, 2022a.
- Image super-resolution via iterative refinement. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 45(4):4713–4726, 2022b.
- Vision transformers for single image dehazing. IEEE Transactions on Image Processing (TIP), 32:1927–1941, 2023.
- Image super-resolution using gradient profile prior. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8. IEEE, 2008.
- Uncertainty-aware unsupervised image deblurring with deep priors guided by domain knowledge. arXiv e-prints, pp. arXiv–2210, 2022.
- Uncertainty-aware unsupervised image deblurring with deep residual prior. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 9883–9892, 2023.
- Villani, C. et al. Optimal transport: old and new, volume 338. Springer, 2009.
- Promptrestorer: A prompting image restoration method with degradation perception. In Advances in Neural Information Processing Systems (NeurIPS), 2023.
- Spatial attentive single-image deraining with a high quality real rain dataset. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2019.
- Optimal transport for unsupervised denoising learning. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 45(2):2104–2118, 2022a.
- Esrgan: Enhanced super-resolution generative adversarial networks. In Proceedings of the European conference on computer vision (ECCV) workshops, pp. 0–0, 2018.
- Uformer: A general u-shaped transformer for image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 17683–17693, 2022b.
- Deraincyclegan: Rain attentive cyclegan for single image deraining and rainmaking. IEEE Transactions on Image Processing, 30:4788–4801, 2021.
- Deep joint rain detection and removal from a single image. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1357–1366, 2017.
- Local implicit normalizing flow for arbitrary-scale image super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
- Multi-stage progressive image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
- Restormer: Efficient transformer for high-resolution image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
- Density-aware single image de-raining using a multi-stream dense network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 695–704, 2018.
- Image de-raining using a conditional generative adversarial network. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 30(11):3943–3956, 2019a.
- The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 586–595, 2018.
- Ranksrgan: Generative adversarial networks with ranker for image super-resolution. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 3096–3105, 2019b.
- Real-time controllable denoising for image and video. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14028–14038, June 2023.
- Large scale image completion via co-modulated generative adversarial networks. In International Conference on Learning Representations (ICLR), 2020.
- Unsupervised deep video denoising with untrained network. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), volume 37, pp. 3651–3659, 2023.
- Fourmer: an efficient global modeling paradigm for image restoration. In International Conference on Machine Learning (ICML), 2023.
- Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 2223–2232, 2017.
- Denoising diffusion models for plug-and-play image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1219–1229, 2023.
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.