ReconFusion: 3D Reconstruction with Diffusion Priors (2312.02981v1)
Abstract: 3D reconstruction methods such as Neural Radiance Fields (NeRFs) excel at rendering photorealistic novel views of complex scenes. However, recovering a high-quality NeRF typically requires tens to hundreds of input images, resulting in a time-consuming capture process. We present ReconFusion to reconstruct real-world scenes using only a few photos. Our approach leverages a diffusion prior for novel view synthesis, trained on synthetic and multiview datasets, which regularizes a NeRF-based 3D reconstruction pipeline at novel camera poses beyond those captured by the set of input images. Our method synthesizes realistic geometry and texture in underconstrained regions while preserving the appearance of observed regions. We perform an extensive evaluation across various real-world datasets, including forward-facing and 360-degree scenes, demonstrating significant performance improvements over previous few-view NeRF reconstruction approaches.
- Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields. CVPR, 2022.
- Zip-NeRF: Anti-aliased grid-based neural radiance fields. ICCV, 2023.
- InstructPix2Pix: Learning to Follow Image Editing Instructions. CVPR, 2023.
- pi-gan: Periodic implicit generative adversarial networks for 3d-aware image synthesis. CVPR, 2021.
- Efficient geometry-aware 3D generative adversarial networks. CVPR, 2022.
- GeNVS: Generative novel view synthesis with 3D-aware diffusion models. arXiv, 2023.
- ShapeNet: An Information-Rich 3D Model Repository. arXiv, 2015.
- Two deterministic half-quadratic regularization algorithms for computed imaging. ICIP, 1994.
- MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo. ICCV, 2021.
- Objaverse: A Universe of Annotated 3D Objects. CVPR, 2023.
- Depth-supervised NeRF: Fewer Views and Faster Training for Free. CVPR, 2022.
- Deepstereo: Learning to predict new views from the world’s imagery. CVPR, 2016.
- 3d shape induction from 2d views of multiple objects. 3DV, 2017.
- Get3d: A generative model of high quality 3d textured shapes learned from images. NeurIPS, 2022.
- Generative adversarial nets. NIPS, 2014.
- NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion. ICML, 2023.
- SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis. ICCV, 2023.
- Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions. ICCV, 2023.
- Escaping plato’s cave: 3d shape from adversarial rendering. ICCV, 2019.
- Unsupervised Learning of 3D Object Categories From Videos in the Wild. ICCV, 2021.
- Denoising Diffusion Probabilistic Models. NeurIPS, 2020.
- Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis. ICCV, 2021.
- Large Scale Multi-view Stereopsis Evaluation. CVPR, 2014.
- Holofusion: Towards photo-realistic 3d generative modeling. ICCV, 2023a.
- Holodiffusion: Training a 3d diffusion model using 2d images. CVPR, 2023b.
- 3D Gaussian Splatting for Real-Time Radiance Field Rendering. SIGGRAPH, 2023.
- GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency. ICML, 2023.
- Magic3D: High-Resolution Text-to-3D Content Creation. CVPR, 2023.
- Zero-1-to-3: Zero-Shot One Image to 3D Object. arXiv, 2023a.
- SyncDreamer: Generating Multiview-consistent Images from a Single-view Image. arXiv, 2023b.
- Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines. SIGGRAPH, 2019.
- NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. ECCV, 2020.
- Hologan: Unsupervised learning of 3d representations from natural images. ICCV, 2019.
- Giraffe: Representing scenes as compositional generative neural feature fields. CVPR, 2021.
- RegNeRF: Regularizing Neural Radiance Fields for View Synthesis from Sparse Inputs. CVPR, 2022.
- DreamFusion: Text-to-3D using 2D Diffusion. ICLR, 2022.
- Learning Transferable Visual Models From Natural Language Supervision. ICML, 2021.
- Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction. ICCV, 2021.
- Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image. CVPR, 2022.
- Dense Depth Priors for Neural Radiance Fields from Sparse Input Views. CVPR, 2022.
- GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields. arXiv preprint arXiv:2306.06044, 2023.
- High-Resolution Image Synthesis with Latent Diffusion Models. CVPR, 2022.
- Scene Representation Transformer: Geometry-Free Novel View Synthesis Through Set-Latent Scene Representations. CVPR, 2022.
- ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image. arXiv:2310.17994, 2023.
- Voxgraf: Fast 3d-aware image synthesis with sparse voxel grids. NeurIPS, 2022.
- MixNeRF: Modeling a Ray with Mixture Density for Novel View Synthesis from Sparse Inputs. CVPR, 2023.
- MVDream: Multi-view Diffusion for 3D Generation. arXiv, 2023.
- ViP-NeRF: Visibility Prior for Sparse Input Neural Radiance Fields. SIGGRAPH, 2023.
- SimpleNeRF: Regularizing Sparse Input Neural Radiance Fields with Simpler Solutions. SIGGRAPH Asia, 2023.
- Denoising Diffusion Implicit Models. ICLR, 2020.
- Generalizable patch-based neural rendering. ECCV, 2022.
- Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision. arXiv, 2023.
- GRF: Learning a General Radiance Field for 3D Representation and Rendering. ICCV, 2021.
- Consistent View Synthesis with Pose-Guided Diffusion Models. CVPR, 2023.
- Single-View View Synthesis With Multiplane Images. CVPR, 2020.
- Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation. CVPR, 2023.
- IBRNet: Learning Multi-View Image-Based Rendering. CVPR, 2021.
- Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs. ICCV, 2023.
- Novel View Synthesis with Diffusion Models. ICLR, 2022.
- DiffusioNeRF: Regularizing neural radiance fields with denoising diffusion models. CVPR, 2023.
- FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization. CVPR, 2023.
- DreamSparse: Escaping from Plato’s Cave with 2D Diffusion Model Given Sparse Views. arXiv, 2023.
- pixelNeRF: Neural Radiance Fields from One or Few Images. CVPR, 2021.
- MVImgNet: A Large-scale Dataset of Multi-view Images. CVPR, 2023.
- The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. CVPR, 2018.
- Stereo Magnification: Learning View Synthesis using Multiplane Images. SIGGRAPH, 2018.
- SparseFusion: Distilling View-conditioned Diffusion for 3D Reconstruction. CVPR, 2023.
- Visual object networks: Image generation with disentangled 3d representations. NeurIPS, 2018.
- Rundi Wu (15 papers)
- Ben Mildenhall (41 papers)
- Philipp Henzler (18 papers)
- Keunhong Park (8 papers)
- Ruiqi Gao (44 papers)
- Daniel Watson (8 papers)
- Pratul P. Srinivasan (38 papers)
- Dor Verbin (21 papers)
- Jonathan T. Barron (89 papers)
- Ben Poole (46 papers)
- Aleksander Holynski (37 papers)