Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Semi-Paired Approach For Label-to-Image Translation (2306.13585v2)

Published 23 Jun 2023 in cs.CV

Abstract: Data efficiency, or the ability to generalize from a few labeled data, remains a major challenge in deep learning. Semi-supervised learning has thrived in traditional recognition tasks alleviating the need for large amounts of labeled data, yet it remains understudied in image-to-image translation (I2I) tasks. In this work, we introduce the first semi-supervised (semi-paired) framework for label-to-image translation, a challenging subtask of I2I which generates photorealistic images from semantic label maps. In the semi-paired setting, the model has access to a small set of paired data and a larger set of unpaired images and labels. Instead of using geometrical transformations as a pretext task like previous works, we leverage an input reconstruction task by exploiting the conditional discriminator on the paired data as a reverse generator. We propose a training algorithm for this shared network, and we present a rare classes sampling algorithm to focus on under-represented classes. Experiments on 3 standard benchmarks show that the proposed model outperforms state-of-the-art unsupervised and semi-supervised approaches, as well as some fully supervised approaches while using a much smaller number of paired samples.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (38)
  1. “Image-to-image translation with conditional adversarial networks” In Conference on Computer Vision and Pattern Recognition (CVPR), 2017
  2. “High-resolution image synthesis and semantic manipulation with conditional GANs” In Conference on Computer Vision and Pattern Recognition (CVPR), 2018
  3. “Generative Adversarial Nets” In NIPS, 2014
  4. L. A. Gatys, A. S. Ecker and M. Bethge “Image Style Transfer Using Convolutional Neural Networks” In Conference on Computer Vision and Pattern Recognition (CVPR), 2016
  5. “Photo-realistic single image super-resolution using a generative adversarial network” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 4681–4690
  6. “Semantic image synthesis with spatially-adaptive normalization” In Conference on Computer Vision and Pattern Recognition (CVPR), 2019
  7. “Rethinking Spatially-Adaptive Normalization” In arXiv:2004.02867, 2020
  8. “Learning to predict layout-to-image conditional convolutions for semantic image synthesis” In Advances in Neural Information Processing Systems (NeurIPS), 2019
  9. “SEAN: Image Synthesis With Semantic Region-Adaptive Normalization” In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 5103–5112
  10. “You Only Need Adversarial Supervision for Semantic Image Synthesis” In International Conference on Learning Representations, 2021
  11. “Semantic Palette: Guiding Scene Generation with Class Proportions” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 9342–9350
  12. “Urban-StyleGAN: Learning to Generate and Manipulate Images of Urban Scenes” In IV-Symposium, 2023
  13. “Towards Pragmatic Semantic Image Synthesis for Urban Scenes” In IV-Symposium, 2023
  14. “Unpaired image-to-image translation using cycle-consistent adversarial networks” In International Conference on Computer Vision (ICCV), 2017
  15. “Multimodal unsupervised image-to-image translation” In European Conference on Computer Vision (ECCV), 2018
  16. “Diverse Image-to-Image Translation via Disentangled Representation” In European Conference on Computer Vision (ECCV), 2018
  17. “Contrastive Learning for Unpaired Image-to-Image Translation” In European Conference on Computer Vision, 2020
  18. “Geometry-consistent generative adversarial networks for one-sided unsupervised domain mapping” In Conference on Computer Vision and Pattern Recognition (CVPR), 2019
  19. “One-sided unsupervised domain mapping” In Advances in Neural Information Processing Systems (NeurIPS), 2017
  20. Yaniv Taigman, Adam Polyak and Lior Wolf “Unsupervised Cross-Domain Image Generation” In International Conference on Learning Representations (ICLR), 2017
  21. “Learning from simulated and unsupervised images through adversarial training” In Conference on Computer Vision and Pattern Recognition (CVPR), 2017
  22. “Unsupervised pixel-level domain adaptation with generative adversarial networks” In Conference on Computer Vision and Pattern Recognition (CVPR), 2017
  23. “Travelgan: Image-to-image translation by transformation vector learning” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019, pp. 8983–8992
  24. Rui Zhang, Tomas Pfister and Jia Li “Harmonic unpaired image-to-image translation” In International Conference on Learning Representations (ICLR), 2019
  25. “USIS: Unsupervised Semantic Image Synthesis” In Computers & Graphics, 2023 URL: https://www.sciencedirect.com/science/article/pii/S0097849323000018
  26. “Wavelet-Based Unsupervised Label-to-Image Translation” In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 1760–1764 DOI: 10.1109/ICASSP43922.2022.9746759
  27. Aamir Mustafa and Rafał K Mantiuk “Transformation consistency regularization–a semi-supervised paradigm for image-to-image translation” In European Conference on Computer Vision, 2020, pp. 599–615 Springer
  28. Jiaze Sun, Binod Bhattarai and Tae-Kyun Kim “MatchGAN: a self-supervised semi-supervised conditional generative adversarial network” In Proceedings of the Asian Conference on Computer Vision, 2020
  29. “Semi-supervised learning for few-shot image-to-image translation” In CVPR, 2020
  30. Manan Oza, Himanshu Vaghela and Sudhir Bagul “Semi-supervised image-to-image translation” In 2019 (ICAIIT)
  31. O. Ronneberger, P. Fischer and T. Brox “U-Net: Convolutional Networks for Biomedical Image Segmentation” In MICCAI, 2015
  32. “SWAGAN: A Style-based Wavelet-driven Generative Model” In ArXiv abs/2102.06108, 2021
  33. “Dualgan: Unsupervised dual learning for image-to-image translation” In International Conference on Computer Vision (ICCV), 2017
  34. “Analyzing and Improving the Image Quality of StyleGAN” In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 8107–8116
  35. “The cityscapes dataset for semantic urban scene understanding” In Conference on Computer Vision and Pattern Recognition (CVPR), 2016
  36. “Scene parsing through ade20k dataset” In Conference on Computer Vision and Pattern Recognition (CVPR), 2017
  37. Holger Caesar, Jasper Uijlings and Vittorio Ferrari “Coco-stuff: Thing and stuff classes in context” In Conference on Computer Vision and Pattern Recognition (CVPR), 2018
  38. “GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium” In Advances in Neural Information Processing Systems (NeurIPS), 2017
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. George Eskandar (15 papers)
  2. Shuai Zhang (319 papers)
  3. Mohamed Abdelsamad (6 papers)
  4. Mark Youssef (1 paper)
  5. Diandian Guo (10 papers)
  6. Bin Yang (320 papers)
Citations (1)