As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors (2311.16739v2)
Abstract: We present As-Plausible-as-Possible (APAP) mesh deformation technique that leverages 2D diffusion priors to preserve the plausibility of a mesh under user-controlled deformation. Our framework uses per-face Jacobians to represent mesh deformations, where mesh vertex coordinates are computed via a differentiable Poisson Solve. The deformed mesh is rendered, and the resulting 2D image is used in the Score Distillation Sampling (SDS) process, which enables extracting meaningful plausibility priors from a pretrained 2D diffusion model. To better preserve the identity of the edited mesh, we fine-tune our 2D diffusion model with LoRA. Gradients extracted by SDS and a user-prescribed handle displacement are then backpropagated to the per-face Jacobians, and we use iterative gradient descent to compute the final deformation that balances between the user edit and the output plausibility. We evaluate our method with 2D and 3D meshes and demonstrate qualitative and quantitative improvements when using plausibility priors over geometry-preservation or distortion-minimization priors used by previous techniques. Our project page is at: https://as-plausible-aspossible.github.io/
- Luma AI. Genie.
- Neural Jacobian Fields: Learning Intrinsic Mappings of Arbitrary Meshes. ACM TOG, 2022.
- MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation. In ICML, 2023.
- Automatic Rigging and Animation of 3D Characters. ACM TOG, 2007.
- MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing. In ICCV, 2023.
- ShapeNet: An Information-Rich 3D Model Repository. arXiv preprint arXiv:1512.03012, 2015.
- Objaverse-XL: A Universe of 10M+ 3D Objects. In NeurIPS, 2023.
- Objaverse: A Universe of Annotated 3D Objects. In CVPR, 2023.
- Mesh editing with poisson-based gradient field manipulation. ACM Trans. Graph., 23(3), 2004.
- Hugging Face. Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch.
- Hugging Face. DreamBooth fine-tuning with LoRA.
- An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion. In ICLR, 2023.
- TextDeformer: Geometry Manipulation using Text Guidance. ACM TOG, 2023.
- GIQA: Generated Image Quality Assessment. In ECCV, 2020.
- Delta Denoising Score. In ICCV, 2023.
- Prompt-to-Prompt Image Editing with Cross-Attention Control. In ICLR, 2023.
- LoRA: Low-Rank Adaptation of Large Language Models. In ICLR, 2022.
- As-Rigid-as-Possible Shape Manipulation. ACM TOG, 2005.
- Word-As-Image for Semantic Typography. ACM TOG, 2023.
- Bounded Biharmonic Weights for Real-Time Deformation. ACM TOG, 2011.
- libigl: A simple C++ geometry processing library, 2018.
- VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models. In CVPR, 2023.
- KeypointDeformer: Unsupervised 3D Keypoint Discovery for Shape Control. In CVPR, 2020.
- Jong Chul Ye Jangho Park, Gihyun Kwon. ED-NeRF: Efficient Text-Guided Editing of 3D Scene using Latent Space NeRF. arXiv, 2023.
- Harmonic Coordinates for Character Articulation. ACM TOG, 2007.
- Mean Value Coordinates for Closed Triangular Meshes. ACM TOG, 2005.
- CLIP-Mesh: Generating Textured Meshes from Text Using Pretrained Image-Text Models. SIGGRAPH ASIA, 2022.
- OptCtrlPoints: Finding the Optimal Control Points for Biharmonic 3D Shape Deformation. Computer Graphics Forum, 2023.
- Collaborative Score Distillation for Consistent Visual Synthesis. In NeurIPS, 2023.
- Adam: A method for stochastic optimization. In ICLR, 2015.
- Segment Anything. In ICCV, 2023.
- SALAD: Part-Level Latent Diffusion for 3D Shape Generation and Manipulation. In ICCV, 2023.
- Modular Primitives for High-Performance Differentiable Rendering. ACM TOG, 2020.
- SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions. In NeurIPS, 2023.
- Green Coordinates. ACM TOG, 2008.
- Differential Coordinates for Interactive Mesh Editing. In Proceedings of Shape Modeling International, pages 181–190, 2004.
- Linear Rotation-Invariant Coordinates for Meshes. ACM TOG, 2005.
- DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes with Biharmonic Coordinates. In CVPR, 2021.
- SyncDreamer: Learning to Generate Multiview-consistent Images from a Single-view Image. arXiv, 2023.
- Text2Mesh: Text-Driven Neural Stylization for Meshes. In CVPR, 2022.
- Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold. ACM TOG, 2023.
- SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis. arXiv, 2023.
- DreamFusion: Text-to-3D using 2D Diffusion. In ICLR, 2023.
- Learning Transferable Visual Models from Natural Language Supervision. In ICML, 2021.
- DreamBooth3D: Subject-Driven Text-to-3D Generation. In ICCV, 2023.
- Daniel Ritchie. Rudimentary framework for running two-alternative forced choice (2afc) perceptual studies on mechanical turk.
- High-Resolution Image Synthesis with Latent Diffusion Models. In CVPR, 2022.
- DreamBooth: Fine Tuning Text-to-image Diffusion Models for Subject-Driven Generation. In CVPR, 2023.
- LAION-5B: An open large-scale dataset for training next generation image-text models. In NeurIPS, 2022.
- Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis. In NeurIPS, 2021.
- MVDream: Multi-view Diffusion for 3D Generation. arXiv, 2023.
- DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing. arXiv, 2023.
- 3D Neural Field Generation using Triplane Diffusion. In CVPR, 2023.
- As-Rigid-As-Possible Surface Modeling. Proceedings of EUROGRAPHICS/ACM SIGGRAPH Symposium on Geometry Processing, pages 109–116, 2007.
- Laplacian Surface Editing. Proceedings of the EUROGRAPHICS/ACM SIGGRAPH Symposium on Geometry Processing, pages 179–188, 2004.
- Neural Shape Deformation Priors. In NeurIPS, 2022.
- DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation. arXiv, 2023.
- Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation. In CVPR, 2023.
- Linear Subspace Design for Real-Time Shape Deformation. ACM TOG, 2015.
- Prolificdreamer: High-fidelity and diverse text-to-3d generation with variational score distillation. In NeurIPS, 2023.
- Complex barycentric coordinates with applications to planar shape deformation. Computer Graphics Forum, 2009.
- Context-Aware Skeletal Shape Deformation. Computer Graphics Forum, 2007.
- OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation. In CVPR, 2023.
- RigNet: Neural Rigging for Articulated Characters. ACM TOG, 2020.
- Morig: Motion-Aware Rigging of Character Meshes from Point Clouds. In SIGGRAPH ASIA, 2022.
- Neural Cages for Detail-Preserving 3D Deformations. In CVPR, 2020.
- Adding Conditional Control to Text-to-Image Diffusion Models. In ICCV, 2023.
- Real-World Image Variation by Aligning Diffusion Inversion Chain. In NeurIPS, 2023.
- DreamEditor: Text-Driven 3D Scene Editing with Neural Fields. ACM TOG, 2023.