Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PixelVAE++: Improved PixelVAE with Discrete Prior (1908.09948v1)

Published 26 Aug 2019 in cs.CV, cs.LG, and stat.ML

Abstract: Constructing powerful generative models for natural images is a challenging task. PixelCNN models capture details and local information in images very well but have limited receptive field. Variational autoencoders with a factorial decoder can capture global information easily, but they often fail to reconstruct details faithfully. PixelVAE combines the best features of the two models and constructs a generative model that is able to learn local and global structures. Here we introduce PixelVAE++, a VAE with three types of latent variables and a PixelCNN++ for the decoder. We introduce a novel architecture that reuses a part of the decoder as an encoder. We achieve the state of the art performance on binary data sets such as MNIST and Omniglot and achieve the state of the art performance on CIFAR-10 among latent variable models while keeping the latent variables informative.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Hossein Sadeghi (10 papers)
  2. Evgeny Andriyash (16 papers)
  3. Walter Vinci (31 papers)
  4. Lorenzo Buffoni (40 papers)
  5. Mohammad H. Amin (28 papers)
Citations (33)

Summary

We haven't generated a summary for this paper yet.