Self-Supervised GANs via Auxiliary Rotation Loss (1811.11212v2)

Published 27 Nov 2018 in cs.LG, cs.CV, and stat.ML

Abstract: Conditional GANs are at the forefront of natural image synthesis. The main drawback of such models is the necessity for labeled data. In this work we exploit two popular unsupervised learning techniques, adversarial training and self-supervision, and take a step towards bridging the gap between conditional and unconditional GANs. In particular, we allow the networks to collaborate on the task of representation learning, while being adversarial with respect to the classic GAN game. The role of self-supervision is to encourage the discriminator to learn meaningful feature representations which are not forgotten during training. We test empirically both the quality of the learned image representations, and the quality of the synthesized images. Under the same conditions, the self-supervised GAN attains a similar performance to state-of-the-art conditional counterparts. Finally, we show that this approach to fully unsupervised learning can be scaled to attain an FID of 23.4 on unconditional ImageNet generation.

Authors (5)

Ting Chen (148 papers)
Xiaohua Zhai (51 papers)
Marvin Ritter (10 papers)
Neil Houlsby (62 papers)
Mario Lucic (42 papers)

Citations (297)

View on Semantic Scholar

Summary

Overview of "Self-Supervised GANs via Auxiliary Rotation Loss"

The paper "Self-Supervised GANs via Auxiliary Rotation Loss" addresses a significant challenge within Generative Adversarial Networks (GANs): the dependency on labeled data for effective training in complex generative tasks. The authors propose a novel self-supervised learning approach to improve GAN training by integrating an auxiliary rotation-based self-supervision task into the training process, thereby creating what they term a self-supervised GAN (SS-GAN).

Methodological Contributions

The central advancement discussed in the paper is the introduction of a self-supervised framework that leverages rotation prediction as a surrogate task. This task is applied to the discriminator in GANs to mitigate issues such as catastrophic forgetting and instability during the adversarial training process. The method involves randomly rotating real and generated images by 0, 90, 180, or 270 degrees and training the discriminator to predict these rotation angles. The core idea is to reinforce the retention of useful data representations within the discriminator that are less prone to being forgotten despite the non-stationary nature of GAN training.

The training procedure for SS-GANs is adjusted such that the conventional adversarial tasks and the self-supervised tasks are presented as a combined objective. The generator's task is to facilitate the learning of rotationally relevant features by the discriminator, albeit in a collaborative manner, while maintaining the adversarial dynamics typically found in GAN models for the true versus fake classification.

Empirical Evaluation

The experimental analysis is robust, covering multiple datasets including CIFAR-10, ImageNet, LSUN-Bedroom, and CelebA-HQ. The experiments reveal that SS-GANs achieve performance on par with conditional GANs (Cond-GANs) using labels when benchmarked against the Frechet Inception Distance (FID). Notably, the SS-GANs attain an FID score of 23.4 in the highly challenging task of unrestricted ImageNet generation, without requiring labeled data. This underscores the efficacy of self-supervision even in complex, large-scale data environments.

The paper provides a comprehensive analysis of the impact of self-supervision on the stability and robustness of GAN training. A notable finding is that incorporating self-supervised loss not only stabilizes the training process but also enhances the quality of generated samples, thereby narrowing the performance disparity between conditional and unconditional GANs.

Theoretical and Practical Implications

By freeing GANs from the dependence on labeled data, this research expands the potential applicability of GANs, particularly in scenarios where labeling can be costly or infeasible. The integration of self-supervised objectives could facilitate broader adoption in domains requiring complex generative tasks, which were traditionally reliant on extensive labeled datasets.

Furthermore, the demonstrated robustness across hyperparameter variations suggests that SS-GANs could offer a more dependable pathway to training GANs in a variety of environments without exhaustive hyperparameter tuning.

Future Directions

This work opens potential research avenues in multiple directions. Future explorations could involve deploying more advanced self-supervision tasks or combining multiple self-supervised signals to further enhance the representation learning within GANs. Additionally, extending this framework to semi-supervised settings or integrating it with cutting-edge GAN architectural innovations could further bolster the generative performance of GANs in unsupervised setups.

In summation, the paper "Self-Supervised GANs via Auxiliary Rotation Loss" articulates a significant stride towards unsupervised natural image synthesis through an innovative approach that couples adversarial and self-supervised learning. This work is a pivotal step in advancing GAN capabilities, particularly in handling complex data configurations without the crutch of labeled datasets.

PDF Markdown

Related Papers

YouTube

Show All Videos