MagicClay: Sculpting Meshes With Generative Neural Fields (2403.02460v4)
Abstract: The recent developments in neural fields have brought phenomenal capabilities to the field of shape generation, but they lack crucial properties, such as incremental control - a fundamental requirement for artistic work. Triangular meshes, on the other hand, are the representation of choice for most geometry related tasks, offering efficiency and intuitive control, but do not lend themselves to neural optimization. To support downstream tasks, previous art typically proposes a two-step approach, where first a shape is generated using neural fields, and then a mesh is extracted for further processing. Instead, in this paper we introduce a hybrid approach that maintains both a mesh and a Signed Distance Field (SDF) representations consistently. Using this representation, we introduce MagicClay - an artist friendly tool for sculpting regions of a mesh according to textual prompts while keeping other regions untouched. Our framework carefully and efficiently balances consistency between the representations and regularizations in every step of the shape optimization; Relying on the mesh representation, we show how to render the SDF at higher resolutions and faster. In addition, we employ recent work in differentiable mesh reconstruction to adaptively allocate triangles in the mesh where required, as indicated by the SDF. Using an implemented prototype, we demonstrate superior generated geometry compared to the state-of-the-art, and novel consistent control, allowing sequential prompt-based edits to the same mesh for the first time.
- Autodesk. 2024. Mudbox. https://www.autodesk.com/products/mudbox.
- ROAR: Robust Adaptive Reconstruction of Shapes Using Planar Projections. arXiv:2307.00690 [cs.GR]
- Cut-and-Paste Editing of Multiresolution Surfaces.
- Blender. 2024. http://www.blender.org.
- Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation. arXiv:2303.13873 [cs.CV]
- Zhiqin Chen and Hao Zhang. 2019. Learning Implicit Fields for Generative Shape Modeling.
- Modeling by Example. ACM Transactions on Graphics (2004).
- TextDeformer: Geometry Manipulation using Text Guidance. arXiv:2304.13348 [cs.CV]
- Michael Garland and Paul S. Heckbert. 2023. Surface Simplification Using Quadric Error Metrics. , 8 pages. https://doi.org/10.1145/3596711.3596727
- threestudio: A unified framework for 3D content generation. https://github.com/threestudio-project/threestudio.
- Skinning: Real-time Shape Deformation.
- A Probabilistic Model of Component-Based Shape Synthesis. ACM Transactions on Graphics 31, 4 (2012).
- Leif Kobbelt. 2000. Sqrt(3)-Subdivision. ACM SIGGRAPH 2000 (05 2000).
- Modular Primitives for High-Performance Differentiable Rendering. ACM Transactions on Graphics 39, 6 (2020).
- Instant3d: Fast text-to-3d with sparse-view generation and large reconstruction model.
- Magic3D: High-Resolution Text-to-3D Content Creation. arXiv:2211.10440 [cs.CV]
- Nerf: Representing scenes as neural radiance fields for view synthesis. , 99–106 pages.
- Instant Neural Graphics Primitives with a Multiresolution Hash Encoding. ACM Trans. Graph. 41, 4, Article 102 (July 2022), 15 pages. https://doi.org/10.1145/3528223.3530127
- Werner Palfinger. 2022. Continuous remeshing for inverse rendering. Computer Animation and Virtual Worlds 33 (07 2022). https://doi.org/10.1002/cav.2101
- DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation.
- Autocomplete 3D Sculpting.
- DreamFusion: Text-to-3D using 2D Diffusion. arXiv:2209.14988 [cs.CV]
- Coupling Explicit and Implicit Surface Representations for Generative 3D Modeling. In Computer Vision – ECCV 2020, Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer International Publishing, Cham, 667–683.
- Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors. arXiv:2306.17843 [cs.CV]
- NeRFMeshing: Distilling Neural Radiance Fields into Geometrically-Accurate 3D Meshes. arXiv:2303.09431 [cs.CV]
- DreamBooth: Fine Tuning Text-to-image Diffusion Models for Subject-Driven Generation.
- MeshHisto: Collaborative Modeling by Sharing and Retargeting Editing Histories. ACM Trans. Graph. (2015).
- Interactive decal compositing with discrete exponential maps.
- Vox-E: Text-guided Voxel Editing of 3D Objects. arXiv:2303.12048 [cs.CV]
- Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis.
- SubstanceModeler. 2024. Substance Modeler. https://www.adobe.com/ie/products/substance3d-modeler.html.
- DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior. arXiv:2310.16818 [cs.CV]
- TextMesh: Generation of Realistic 3D Meshes From Text Prompts. arXiv:2304.12439 [cs.CV]
- HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces. arXiv:2312.03160 [cs.CV]
- ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation. arXiv:2305.16213 [cs.LG]
- Adaptive Shells for Efficient Neural Radiance Field Rendering. , 15 pages. https://doi.org/10.1145/3618390
- Volume rendering of neural implicit surfaces.
- Mesh Colors. ACM Trans. Graph. 29 (03 2010). https://doi.org/10.1145/1731047.1731053
- Semantic Shape Editing Using Deformation Handles.
- ZBrush. 2024. https://www.maxon.net/en/zbrush.