Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ControlVAE: Controllable Variational Autoencoder (2004.05988v5)

Published 13 Apr 2020 in cs.LG and stat.ML

Abstract: Variational Autoencoders (VAE) and their variants have been widely used in a variety of applications, such as dialog generation, image generation and disentangled representation learning. However, the existing VAE models have some limitations in different applications. For example, a VAE easily suffers from KL vanishing in LLMing and low reconstruction quality for disentangling. To address these issues, we propose a novel controllable variational autoencoder framework, ControlVAE, that combines a controller, inspired by automatic control theory, with the basic VAE to improve the performance of resulting generative models. Specifically, we design a new non-linear PI controller, a variant of the proportional-integral-derivative (PID) control, to automatically tune the hyperparameter (weight) added in the VAE objective using the output KL-divergence as feedback during model training. The framework is evaluated using three applications; namely, LLMing, disentangled representation learning, and image generation. The results show that ControlVAE can achieve better disentangling and reconstruction quality than the existing methods. For LLMling, it not only averts the KL-vanishing, but also improves the diversity of generated text. Finally, we also demonstrate that ControlVAE improves the reconstruction quality of generated images compared to the original VAE.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Huajie Shao (29 papers)
  2. Shuochao Yao (18 papers)
  3. Dachun Sun (12 papers)
  4. Aston Zhang (48 papers)
  5. Shengzhong Liu (23 papers)
  6. Dongxin Liu (13 papers)
  7. Jun Wang (991 papers)
  8. Tarek Abdelzaher (58 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.