Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning Versatile 3D Shape Generation with Improved AR Models (2303.14700v1)

Published 26 Mar 2023 in cs.CV

Abstract: Auto-Regressive (AR) models have achieved impressive results in 2D image generation by modeling joint distributions in the grid space. While this approach has been extended to the 3D domain for powerful shape generation, it still has two limitations: expensive computations on volumetric grids and ambiguous auto-regressive order along grid dimensions. To overcome these limitations, we propose the Improved Auto-regressive Model (ImAM) for 3D shape generation, which applies discrete representation learning based on a latent vector instead of volumetric grids. Our approach not only reduces computational costs but also preserves essential geometric details by learning the joint distribution in a more tractable order. Moreover, thanks to the simplicity of our model architecture, we can naturally extend it from unconditional to conditional generation by concatenating various conditioning inputs, such as point clouds, categories, images, and texts. Extensive experiments demonstrate that ImAM can synthesize diverse and faithful shapes of multiple categories, achieving state-of-the-art performance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Simian Luo (9 papers)
  2. Xuelin Qian (31 papers)
  3. Yanwei Fu (200 papers)
  4. Yinda Zhang (68 papers)
  5. Ying Tai (88 papers)
  6. Zhenyu Zhang (250 papers)
  7. Chengjie Wang (178 papers)
  8. Xiangyang Xue (169 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.