Collapse by Conditioning: Training Class-conditional GANs with Limited Data (2201.06578v2)

Published 17 Jan 2022 in cs.CV and cs.AI

Abstract: Class-conditioning offers a direct means to control a Generative Adversarial Network (GAN) based on a discrete input variable. While necessary in many applications, the additional information provided by the class labels could even be expected to benefit the training of the GAN itself. On the contrary, we observe that class-conditioning causes mode collapse in limited data settings, where unconditional learning leads to satisfactory generative ability. Motivated by this observation, we propose a training strategy for class-conditional GANs (cGANs) that effectively prevents the observed mode-collapse by leveraging unconditional learning. Our training strategy starts with an unconditional GAN and gradually injects the class conditioning into the generator and the objective function. The proposed method for training cGANs with limited data results not only in stable training but also in generating high-quality images, thanks to the early-stage exploitation of the shared information across classes. We analyze the observed mode collapse problem in comprehensive experiments on four datasets. Our approach demonstrates outstanding results compared with state-of-the-art methods and established baselines. The code is available at https://github.com/mshahbazi72/transitional-cGAN

View on arXiv

Authors (4)

Mohamad Shahbazi (9 papers)
Martin Danelljan (96 papers)
Danda Pani Paudel (95 papers)
Luc Van Gool (570 papers)

Citations (31)

View on Semantic Scholar

Summary

Class-Conditional GANs with Limited Data: Mitigating Mode Collapse via Hybrid Training Strategy

The paper "Collapse by Conditioning: Training Class-conditional GANs with Limited Data" addresses a critical issue in the training of class-conditional Generative Adversarial Networks (cGANs) when data is scarce: mode collapse. This phenomenon, which compromises the diversity of generated samples, is particularly aggravated by the incorporation of class-conditioning in environments with limited data. The authors identify this problem through empirical observations and propose an innovative training strategy that transitions from unconditional to conditional learning, enhancing model stability and sample diversity.

Research Context and Problem Identification

In the arena of generative models, GANs have garnered significant attention for their ability to produce realistic synthetic data. Conditional GANs extend this capability by incorporating class labels to control the output class, which is vital in applications like image synthesis and manipulation. However, training cGANs requires copious amounts of labeled data, a constraint that is not always feasible due to challenges in data collection and privacy concerns. Prior research has largely overlooked the adverse effects of conditioning in low-data regimes, often presuming that class labels should guide the generator effectively even when data is sparse.

Through extensive experimentation, the authors reveal a counter-intuitive phenomenon: class-conditioned GANs underperform their unconditional counterparts in limited-data scenarios, suffering from pronounced mode collapse. Indeed, visual outputs illustrate a stark lack of intra-class diversity with severe artifacts, despite the presence of conditioning labels that intuitively should aid learning.

Proposed Methodology

The paper introduces a gradual, hybrid training strategy that leverages the strengths of unconditional GANs to fortify the initial stages of training. This is achieved through a controlled transition, seamlessly integrating class-conditioning as training progresses. The generator architecture is adapted minimally to incorporate class information progressively. By conditioning the generator also on a transition parameter, the model gradually shifts from an unconditional paradigm to conditionality with an increasing influence of class labels over the training iterations.

This method imposes minimal architectural changes allowing the modification to be universally applicable across different GAN frameworks. Moreover, the authors propose a revised training objective that adds conditional losses in a weighted manner, controlled by the transition parameter, thus ensuring stability throughout the learning process.

Experimental Evaluation

Empirical evaluations are performed on four datasets with constrained data conditions. The authors utilize metrics like FID and KID to quantify generative quality and diversity, demonstrating that their approach effectively surpasses both state-of-the-art models and various baselines. The results underscore the robustness of the proposed transitional approach—increasing diversity and fidelity while maintaining stable training dynamics even in challenging data-constrained settings.

Importantly, the paper examines different hyper-parameter settings concerning the transition period and frequency, concluding that the method is largely insensitive to specific settings, thereby reducing the necessity for extensive hyper-parameter tuning. This adaptability further underscores the practicality of the proposed solution.

Implications and Future Directions

The implications of this research are noteworthy as it resolves a previously unexamined and significant issue in the cGAN domain, paving the way for more robust training methodologies in low-sample environments. This advancement will have practical benefits in fields requiring high-fidelity synthetic data from limited, labeled inputs, such as medical imaging, autonomous driving simulations, and privacy-sensitive data generation domains.

Future work could explore extensions of the proposed method to other forms of conditioning, such as semantic maps or attribute-based conditioning. Moreover, integrating insights from this research may stimulate further investigations into adversarial frameworks' adaptability and robustness, potentially leading to innovation across a variety of generative tasks. Additionally, embracing a unified approach in architectures other than GANs, like VAEs or diffusion models, can broaden the horizon of generative models' application in resource-constrained environments.

PDF Markdown