Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Taming the Tail in Class-Conditional GANs: Knowledge Sharing via Unconditional Training at Lower Resolutions (2402.17065v2)

Published 26 Feb 2024 in cs.CV, cs.AI, and cs.LG

Abstract: Despite extensive research on training generative adversarial networks (GANs) with limited training data, learning to generate images from long-tailed training distributions remains fairly unexplored. In the presence of imbalanced multi-class training data, GANs tend to favor classes with more samples, leading to the generation of low-quality and less diverse samples in tail classes. In this study, we aim to improve the training of class-conditional GANs with long-tailed data. We propose a straightforward yet effective method for knowledge sharing, allowing tail classes to borrow from the rich information from classes with more abundant training data. More concretely, we propose modifications to existing class-conditional GAN architectures to ensure that the lower-resolution layers of the generator are trained entirely unconditionally while reserving class-conditional generation for the higher-resolution layers. Experiments on several long-tail benchmarks and GAN architectures demonstrate a significant improvement over existing methods in both the diversity and fidelity of the generated images. The code is available at https://github.com/khorrams/utlo.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (56)
  1. Demystifying mmd gans. In International Conference on Learning Representations, 2018.
  2. Large scale GAN training for high fidelity natural image synthesis. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net, 2019.
  3. Learning imbalanced datasets with label-distribution-aware margin loss. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2019.
  4. Reslt: Residual learning for long-tailed recognition. ieee transactions on pattern analysis and machine intelligence, 2021.
  5. Class-balanced loss based on effective number of samples. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, CA, USA, June 16-20, 2019, pages 9268–9277. Computer Vision Foundation / IEEE, 2019.
  6. Diffusion models beat gans on image synthesis. In Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual, pages 8780–8794, 2021.
  7. Generative adversarial nets. In Advances in Neural Information Processing Systems. Curran Associates, Inc., 2014.
  8. Long-tailed multi-label visual recognition by collaborative training on uniform and re-balanced samplings. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15089–15098, 2021.
  9. Borderline-smote: a new over-sampling method in imbalanced data sets learning. In International conference on intelligent computing, pages 878–887. Springer, 2005.
  10. Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems, 30, 2017.
  11. The inaturalist 2019 competition dataset. https://www.kaggle.com/c/inaturalist-2019-fgvc6.
  12. Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1125–1134, 2017.
  13. Decoupling representation and classifier for long-tailed recognition. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020.
  14. Exploring balanced feature spaces for representation learning. In International Conference on Learning Representations, 2021.
  15. MSG-GAN: multi-scale gradients for generative adversarial networks. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020, pages 7796–7805. Computer Vision Foundation / IEEE, 2020.
  16. Training generative adversarial networks with limited data. Advances in Neural Information Processing Systems, 33:12104–12114, 2020a.
  17. Analyzing and improving the image quality of stylegan. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020, pages 8107–8116. Computer Vision Foundation / IEEE, 2020b.
  18. Few-shot image generation with mixup-based distance learning. In Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XV, pages 563–580. Springer, 2022.
  19. Learning multiple layers of features from tiny images. preprint, 2009.
  20. Ensembling off-the-shelf models for gan training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10651–10662, 2022.
  21. Photo-realistic single image super-resolution using a generative adversarial network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4681–4690, 2017.
  22. Towards faster and stabilized gan training for high-fidelity few-shot image synthesis. iclr, 2021.
  23. Deep representation learning on long-tailed data: A learnable embedding augmentation perspective. Computer Vision And Pattern Recognition, 2020.
  24. Under-sampling and feature selection algorithms for s2smlp. IEEE Access, 8:191803–191814, 2020.
  25. Large-scale long-tailed recognition in an open world. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2537–2546, 2019.
  26. Are gans created equal? a large-scale study. Advances in neural information processing systems, 31, 2018.
  27. Long-tail learning via logit adjustment. In International Conference on Learning Representations, 2021.
  28. Conditional generative adversarial nets. arXiv preprint arXiv: Arxiv-1411.1784, 2014.
  29. Spectral normalization for generative adversarial networks. ICLR, 2018.
  30. Generative adversarial minority oversampling. Ieee International Conference On Computer Vision, 2019.
  31. Automated flower classification over a large number of classes. In Indian Conference on Computer Vision, Graphics and Image Processing, 2008.
  32. Conditional image synthesis with auxiliary classifier gans. In Proceedings of the 34th International Conference on Machine Learning, ICML 2017, Sydney, NSW, Australia, 6-11 August 2017, pages 2642–2651. PMLR, 2017.
  33. Few-shot image generation via cross-domain correspondence. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 10743–10752, 2021.
  34. Class balancing gan with a classifier in the loop. In Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, pages 1618–1627. PMLR, 2021.
  35. Improving gans for long-tailed data through group spectral regularization. European Conference On Computer Vision, 2022.
  36. Noisytwins: Class-consistent and diverse image generation through stylegans. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5987–5996, 2023.
  37. Balanced meta-softmax for long-tailed visual recognition. In Advances in Neural Information Processing Systems, pages 4175–4186. Curran Associates, Inc., 2020.
  38. A classification-based study of covariate shift in gan distributions. International Conference On Machine Learning, 2017.
  39. Projected gans converge faster. Advances in Neural Information Processing Systems, 34:17480–17492, 2021.
  40. Stylegan-xl: Scaling stylegan to large diverse datasets. In SIGGRAPH ’22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Vancouver, BC, Canada, August 7 - 11, 2022, pages 49:1–49:10. ACM, 2022.
  41. Collapse by conditioning: Training class-conditional GANs with limited data. In International Conference on Learning Representations, 2022.
  42. Closed-form factorization of latent semantics in gans. In CVPR, 2021.
  43. Learning hybrid image templates (hit) by information projection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(7):1354–1367, 2012.
  44. Long-tailed classification by keeping the good and removing the bad momentum causal effect. In Advances in Neural Information Processing Systems, pages 1513–1524. Curran Associates, Inc., 2020.
  45. Regularizing generative adversarial networks under limited data. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 7921–7931, 2021.
  46. Long-tailed recognition by routing diverse distribution-aware experts. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021.
  47. A survey on long-tailed visual recognition. International Journal of Computer Vision, 130:1837–1872, 2022.
  48. An improved under-sampling imbalanced classification algorithm. In 2021 13th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA), pages 775–779. IEEE, 2021.
  49. Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365, 2015.
  50. Stackgan: Text to photo-realistic image synthesis with stacked generative adversarial networks. In Proceedings of the IEEE international conference on computer vision, pages 5907–5915, 2017.
  51. Consistency regularization for generative adversarial networks. In International Conference on Learning Representations, 2019.
  52. The unreasonable effectiveness of deep features as a perceptual metric. Ieee/cvf Conference On Computer Vision And Pattern Recognition, 2018.
  53. Deep long-tailed learning: A survey. arXiv preprint arXiv: Arxiv-2110.04596, 2021.
  54. Learning fast sample re-weighting without reward data. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 725–734, 2021.
  55. Differentiable augmentation for data-efficient gan training. Advances in Neural Information Processing Systems, 33:7559–7570, 2020.
  56. Improving calibration for long-tailed recognition. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021, pages 16489–16498. Computer Vision Foundation / IEEE, 2021.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Saeed Khorram (10 papers)
  2. Mingqi Jiang (6 papers)
  3. Mohamad Shahbazi (9 papers)
  4. Mohamad H. Danesh (10 papers)
  5. Li Fuxin (36 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.