Lifelong GAN: Continual Learning for Conditional Image Generation (1907.10107v2)

Published 23 Jul 2019 in cs.CV

Abstract: Lifelong learning is challenging for deep neural networks due to their susceptibility to catastrophic forgetting. Catastrophic forgetting occurs when a trained network is not able to maintain its ability to accomplish previously learned tasks when it is trained to perform new tasks. We study the problem of lifelong learning for generative models, extending a trained network to new conditional generation tasks without forgetting previous tasks, while assuming access to the training data for the current task only. In contrast to state-of-the-art memory replay based approaches which are limited to label-conditioned image generation tasks, a more generic framework for continual learning of generative models under different conditional image generation settings is proposed in this paper. Lifelong GAN employs knowledge distillation to transfer learned knowledge from previous networks to the new network. This makes it possible to perform image-conditioned generation tasks in a lifelong learning setting. We validate Lifelong GAN for both image-conditioned and label-conditioned generation tasks, and provide qualitative and quantitative results to show the generality and effectiveness of our method.

Authors (6)

Mengyao Zhai (10 papers)
Lei Chen (485 papers)
Fred Tung (1 paper)
Jiawei He (41 papers)
Megha Nawhal (7 papers)
Greg Mori (65 papers)

Citations (166)

View on Semantic Scholar

Summary

Lifelong GAN: Continual Learning for Conditional Image Generation

The paper "Lifelong GAN: Continual Learning for Conditional Image Generation" addresses the significant challenge of catastrophic forgetting in deep neural networks when applied to continuous or lifelong learning paradigms. The authors propose a novel framework, Lifelong GAN, which integrates knowledge distillation to extend generative adversarial networks (GANs) for continual learning of image generation tasks under various conditional inputs, such as labels and images.

In contrast to traditional deep learning models that require concurrent access to all training data, which increases storage demands and may breach data privacy, Lifelong GAN operates with access to the training data for the current task only. This approach circumvents the necessity of memory replay, which is typically limited to label-conditioned tasks, and enables conditional image generation frameworks to handle image-conditioned tasks effectively.

Contributions and Methodology

The paper's contributions are threefold:

Generic Framework Proposal: The authors introduce a framework capable of handling both image-conditioned and label-conditioned generative tasks by leveraging knowledge distillation, a technique earlier applied in classifier performance enhancement, to transfer learned knowledge between model iterations.
Validation on Conditional Inputs: Through comprehensive experiments, Lifelong GAN demonstrates robust performance in both image and label scenarios, suggesting its effectiveness in preserving task-specific knowledge without degradation due to sequential task learning.
Demonstration of Generality: Lifelong GAN is shown to function across diverse data domains, reinforcing its adaptability and robustness in changing task environments.

Technical Insights

The underlying model, BicycleGAN, is modified to support continual learning through knowledge distillation. This involves encouraging the network undergoing training to mimic outputs of a pre-trained model on auxiliary data derived from techniques such as Montage and Swap, thus efficiently preventing catastrophic forgetting.

A critical aspect of Lifelong GAN is accurately balancing the competing objectives of learning new tasks and maintaining knowledge of old tasks. Successful application of knowledge distillation ensures better alignment between the outputs of the current model and a reference model, enabling learning while avoiding degradation in performance on earlier tasks.

Experimental Results

The experimental evaluations conducted span image-conditioned tasks such as digit generation and image-to-image translation to label-conditioned tasks on datasets like MNIST and the Flower dataset. For label-conditioned tasks, Lifelong GAN achieved comparable or superior performance metrics compared to existing replay-based mechanisms, notably outperforming on measures of classification accuracy and reverse classification accuracy.

Implications and Future Directions

The authors discuss the implications of their work on both practical and theoretical fronts. Practically, Lifelong GAN's approach is significant for applications like dynamic data environments where memory constraints or privacy concerns limit retraining on previous data. Theoretically, this opens avenues for the development of more sophisticated learning mechanisms that mimic biological continual learning processes.

One possible future development is refining the auxiliary data generation mechanisms to further enhance model stability and performance during continual learning. Additionally, exploring more complex conditional signals could broaden Lifelong GAN's applicability and versatility in real-world scenarios.

In sum, the Lifelong GAN framework offers a valuable advancement in generative model learning by deftly combining knowledge distillation and generative adversarial learning to address the persistent issue of catastrophic forgetting within continual learning environments.

PDF Markdown

Related Papers

Find Related Papers