Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-objects Generation with Amortized Structural Regularization (1906.03923v1)

Published 10 Jun 2019 in cs.LG and stat.ML

Abstract: Deep generative models (DGMs) have shown promise in image generation. However, most of the existing work learn the model by simply optimizing a divergence between the marginal distributions of the model and the data, and often fail to capture the rich structures and relations in multi-object images. Human knowledge is a critical element to the success of DGMs to infer these structures. In this paper, we propose the amortized structural regularization (ASR) framework, which adopts the posterior regularization (PR) to embed human knowledge into DGMs via a set of structural constraints. We derive a lower bound of the regularized log-likelihood, which can be jointly optimized with respect to the generative model and recognition model efficiently. Empirical results show that ASR significantly outperforms the DGM baselines in terms of inference accuracy and sample quality.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Kun Xu (277 papers)
  2. Chongxuan Li (75 papers)
  3. Jun Zhu (424 papers)
  4. Bo Zhang (633 papers)
Citations (17)

Summary

We haven't generated a summary for this paper yet.