Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks (1701.04722v4)

Published 17 Jan 2017 in cs.LG

Abstract: Variational Autoencoders (VAEs) are expressive latent variable models that can be used to learn complex probability distributions from training data. However, the quality of the resulting model crucially relies on the expressiveness of the inference model. We introduce Adversarial Variational Bayes (AVB), a technique for training Variational Autoencoders with arbitrarily expressive inference models. We achieve this by introducing an auxiliary discriminative network that allows to rephrase the maximum-likelihood-problem as a two-player game, hence establishing a principled connection between VAEs and Generative Adversarial Networks (GANs). We show that in the nonparametric limit our method yields an exact maximum-likelihood assignment for the parameters of the generative model, as well as the exact posterior distribution over the latent variables given an observation. Contrary to competing approaches which combine VAEs with GANs, our approach has a clear theoretical justification, retains most advantages of standard Variational Autoencoders and is easy to implement.

Authors (3)

Lars Mescheder (12 papers)
Sebastian Nowozin (45 papers)
Andreas Geiger (136 papers)

Citations (515)

View on Semantic Scholar

Summary

Adversarial Variational Bayes: Integrating VAEs and GANs

The paper "Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks" introduces a novel approach called Adversarial Variational Bayes (AVB) which integrates Variational Autoencoders (VAEs) with Generative Adversarial Networks (GANs). This paper enables the training of VAEs using highly expressive inference models, which has been a limitation in traditional methodologies.

Core Contributions

The primary contribution of this paper is the introduction of an auxiliary discriminative network to VAE training. This allows the maximum likelihood problem to be formulated as a two-player game, thereby synthesizing a foundational connection between VAEs and GANs. Key contributions of the paper include:

Expressive Inference Models: AVB utilizes adversarial training to incorporate inference models with arbitrary complexity, enabling better approximation of complex posterior distributions.
Theoretical Justification: The authors provide a theoretical demonstration that in the nonparametric limit, AVB results in exact maximum-likelihood assignments for the generative model parameters and the precise posterior distribution.
Ease of Implementation: Contrary to other approaches that attempt to combine VAEs and GANs, AVB maintains the theoretical benefits of standard VAEs and simplifies the implementation process.

Theoretical Framework

The research redefines the VAE optimization problem through adversarial means. A new discriminative network $T(x,z)$ is introduced, converting the problem into a game where the discriminator attempts to differentiate between samples from the prior distribution and the approximate posterior. A crucial finding is that an optimal discriminator output $T^*$ can represent the complexity difference between these distributions.

Empirical Evaluation

The paper presents strong empirical evidence demonstrating the efficacy of AVB. These include:

Variational Inference: The method outperforms traditional Gaussian inference models in capturing complex posterior distributions, as evaluated on the "Eight Schools" example.
Generative Modeling: On the MNIST dataset, AVB exhibited competitive log-likelihoods compared to state-of-the-art models, showcasing its ability to generate samples perceived as realistic.

Implications and Future Directions

The integration of highly expressive inference models with a principled adversarial approach in VAEs offers significant implications for generative model development. It addresses challenges related to the expressiveness of traditional VAEs and paves the way for utilizing AVB in a variety of unsupervised learning tasks.

Future research could explore:

Architectural Innovations: Developing neural architectures that can better approximate the optimal discriminator could enhance learning and stability.
Extending AVB: Applying AVB to more diverse datasets or incorporating it with other generative paradigms may offer new insights and advancements in AI research.
Optimization Strategies: Investigating alternative optimization techniques to improve the convergence properties of the adversarial training framework.

In conclusion, Adversarial Variational Bayes represents a significant theoretical and practical advancement in merging the strengths of VAEs and GANs. By setting new benchmarks in the quality of generative models and inference precision, this research opens multiple avenues for future exploration in the field of generative modeling and unsupervised learning.

PDF Markdown

Related Papers

Find Related Papers