Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis (2204.06307v1)

Published 13 Apr 2022 in cs.CV

Abstract: 3D-aware image synthesis aims to generate images of objects from multiple views by learning a 3D representation. However, one key challenge remains: existing approaches lack geometry constraints, hence usually fail to generate multi-view consistent images. To address this challenge, we propose Multi-View Consistent Generative Adversarial Networks (MVCGAN) for high-quality 3D-aware image synthesis with geometry constraints. By leveraging the underlying 3D geometry information of generated images, i.e., depth and camera transformation matrix, we explicitly establish stereo correspondence between views to perform multi-view joint optimization. In particular, we enforce the photometric consistency between pairs of views and integrate a stereo mixup mechanism into the training process, encouraging the model to reason about the correct 3D shape. Besides, we design a two-stage training strategy with feature-level multi-view joint optimization to improve the image quality. Extensive experiments on three datasets demonstrate that MVCGAN achieves the state-of-the-art performance for 3D-aware image synthesis.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Xuanmeng Zhang (6 papers)
  2. Zhedong Zheng (67 papers)
  3. Daiheng Gao (10 papers)
  4. Bang Zhang (33 papers)
  5. Pan Pan (24 papers)
  6. Yi Yang (856 papers)
Citations (46)

Summary

We haven't generated a summary for this paper yet.