PolyDiff: Generating 3D Polygonal Meshes with Diffusion Models (2312.11417v1)
Abstract: We introduce PolyDiff, the first diffusion-based approach capable of directly generating realistic and diverse 3D polygonal meshes. In contrast to methods that use alternate 3D shape representations (e.g. implicit representations), our approach is a discrete denoising diffusion probabilistic model that operates natively on the polygonal mesh data structure. This enables learning of both the geometric properties of vertices and the topological characteristics of faces. Specifically, we treat meshes as quantized triangle soups, progressively corrupted with categorical noise in the forward diffusion phase. In the reverse diffusion phase, a transformer-based denoising network is trained to revert the noising process, restoring the original mesh structure. At inference, new meshes can be generated by applying this denoising network iteratively, starting with a completely noisy triangle soup. Consequently, our model is capable of producing high-quality 3D polygonal meshes, ready for integration into downstream 3D workflows. Our extensive experimental analysis shows that PolyDiff achieves a significant advantage (avg. FID and JSD improvement of 18.2 and 5.8 respectively) over current state-of-the-art methods.
- Learning representations and generative models for 3d point clouds. In ICML, 2018.
- Structured denoising diffusion models in discrete state-spaces. In NeurIPS, 2021.
- All are worth words: A vit backbone for diffusion models. In CVPR, 2023.
- ShapeNet: An Information-Rich 3D Model Repository. preprint arXiv:1512.03012, 2015.
- PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models. arXiv preprint arXiv:2306.01461, 2023.
- Learning implicit fields for generative shape modeling. In CVPR, 2019.
- Bsp-net: Generating compact meshes via binary space partitioning. In CVPR, 2020.
- Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
- Diffusion models beat GANs on image synthesis. In NeurIPS, 2021.
- An image is worth 16x16 words: Transformers for image recognition at scale. In ICLR, 2021.
- A point set generation network for 3d object reconstruction from a single image. In CVPR, 2017.
- Learning deformable tetrahedral meshes for 3d reconstruction. In NeurIPS, 2020.
- Generative adversarial nets. In NeurIPS, 2014.
- AtlasNet: A Papier-Mâché Approach to Learning 3D Surface Generation. In CVPR, 2018.
- Gans trained by a two time-scale update rule converge to a local nash equilibrium. In NeurIPS, 2017.
- Denoising diffusion probabilistic models. In NeurIPS, 2020.
- Equivariant diffusion for molecule generation in 3d. In ICLR, 2022.
- Elucidating the design space of diffusion-based generative models. In NeurIPS, 2022.
- Auto-Encoding Variational Bayes. In ICLR, 2014.
- MeshDiffusion: Score-based generative 3d mesh modeling. In ICLR, 2023.
- Marching cubes: A high resolution 3d surface construction algorithm. In SIGGRAPH, 1987.
- Decoupled weight decay regularization. In ICLR, 2019.
- Diffusion probabilistic models for 3d point cloud generation. In CVPR, 2021.
- PolyGen: An autoregressive generative model of 3D meshes. In ICML, 2020.
- Improved denoising diffusion probabilistic models. ArXiv preprint, 2021.
- State of the art on diffusion models for visual computing. arXiv preprint arXiv:2310.07204, 2023.
- High-resolution image synthesis with latent diffusion models. In CVPR, 2022.
- Housediffusion: Vector floorplan generation via a diffusion model with discrete and continuous denoising. In CVPR, 2023.
- Diffusion-based signed distance fields for 3d shape generation. In CVPR, 2023.
- Deep unsupervised learning using nonequilibrium thermodynamics. In ICML, 2015.
- Consistency models. arXiv preprint arXiv:2303.01469, 2023.
- Digress: Discrete denoising diffusion for graph generation. In ICLR, 2023.
- Learning a probabilistic latent space of object shapes via 3d generative-adversarial modeling. In NeurIPS, 2016.
- LION: Latent point diffusion models for 3d shape generation. In NeurIPS, 2022.
- 3d shape generation and completion through point-voxel diffusion. In CVPR, 2021.