BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes (2407.15848v1)
Abstract: While Neural Radiance Fields (NeRFs) have demonstrated exceptional quality, their protracted training duration remains a limitation. Generalizable and MVS-based NeRFs, although capable of mitigating training time, often incur tradeoffs in quality. This paper presents a novel approach called BoostMVSNeRFs to enhance the rendering quality of MVS-based NeRFs in large-scale scenes. We first identify limitations in MVS-based NeRF methods, such as restricted viewport coverage and artifacts due to limited input views. Then, we address these limitations by proposing a new method that selects and combines multiple cost volumes during volume rendering. Our method does not require training and can adapt to any MVS-based NeRF methods in a feed-forward fashion to improve rendering quality. Furthermore, our approach is also end-to-end trainable, allowing fine-tuning on specific scenes. We demonstrate the effectiveness of our method through experiments on large-scale datasets, showing significant rendering quality improvements in large-scale scenes and unbounded outdoor scenarios. We release the source code of BoostMVSNeRFs at https://su-terry.github.io/BoostMVSNeRFs/.
- Neural point-based graphics. In ECCV.
- Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields. In ICCV.
- Mip-nerf 360: Unbounded anti-aliased neural radiance fields. In CVPR.
- Zip-NeRF: Anti-aliased grid-based neural radiance fields. In ICCV.
- Nerd: Neural reflectance decomposition from image collections. In ICCV.
- Fwd: Real-time novel view synthesis with forward warping and depth. In CVPR.
- Depth synthesis and local warps for plausible image-based navigation. ACM TOG (2013).
- Tensorf: Tensorial radiance fields. In ECCV.
- Mvsnerf: Fast generalizable radiance field reconstruction from multi-view stereo. In ICCV.
- Point-based multi-view stereo network. In ICCV.
- Explicit Correspondence Matching for Generalizable Neural Radiance Fields. arXiv preprint arXiv:2304.12294 (2023).
- Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields. In AAAI.
- Stereo radiance fields (srf): Learning view synthesis for sparse views of novel scenes. In CVPR.
- Scannet: Richly-annotated 3d reconstructions of indoor scenes. In CVPR.
- Modeling and rendering architecture from photographs: A hybrid geometry-and image-based approach. In Seminal Graphics Papers: Pushing the Boundaries, Volume 2.
- Depth-supervised nerf: Fewer views and faster training for free. In CVPR.
- Peeking behind objects: Layered depth prediction from a single image. Pattern Recognition Letters (2019).
- Deepview: View synthesis with learned gradient descent. In CVPR.
- Deepstereo: Learning to predict new views from the world’s imagery. In CVPR.
- SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes. In CVPR.
- Layered depth images. In SIGGRAPH.
- Cascade cost volume for high-resolution multi-view stereo and stereo matching. In CVPR.
- Baking neural radiance fields for real-time view synthesis. In ICCV.
- Putting nerf on a diet: Semantically consistent few-shot view synthesis. In ICCV.
- Sdfdiff: Differentiable rendering of signed distance fields for 3d shape optimization. In CVPR.
- Geonerf: Generalizing nerf with geometry priors. In CVPR.
- Learning-based view synthesis for light field cameras. ACM TOG (2016).
- Infonerf: Ray entropy minimization for few-shot neural volume rendering. In CVPR.
- Neural 3d video synthesis from multi-view video. In CVPR.
- Crowdsampling the plenoptic function. In ECCV.
- Im4d: High-fidelity and real-time novel view synthesis for dynamic scenes. arXiv preprint arXiv:2310.08585 (2023).
- Efficient neural radiance fields for interactive free-viewpoint video. In SIGGRAPH Asia.
- Vision transformer for nerf-based view synthesis from a single input image. In WACV.
- Neural rendering and reenactment of human actor videos. ACM TOG (2019).
- Neural rays for occlusion-aware image-based rendering. In CVPR.
- Robust dynamic radiance fields. In CVPR.
- Neural volumes: Learning dynamic renderable volumes from images. ACM TOG (2019).
- Mixture of volumetric primitives for efficient neural rendering. ACM TOG (2021).
- Progressively optimized local radiance fields for robust view synthesis. In CVPR.
- Local light field fusion: Practical view synthesis with prescriptive sampling guidelines. ACM TOG (2019).
- NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. In ECCV.
- Instant neural graphics primitives with a multiresolution hash encoding. ACM TOG (2022).
- Extracting triangular 3d models, materials, and lighting from images. In CVPR.
- An analysis of approximations for maximizing submodular set functions—I. Mathematical programming (1978).
- Regnerf: Regularizing neural radiance fields for view synthesis from sparse inputs. In CVPR.
- Unisurf: Unifying neural implicit surfaces and radiance fields for multi-view reconstruction. In ICCV.
- Nerfies: Deformable neural radiance fields. In ICCV.
- Eric Penner and Li Zhang. 2017. Soft 3d reconstruction for view synthesis. ACM TOG (2017).
- Surfels: Surface elements as rendering primitives. In Proceedings of the 27th annual conference on Computer graphics and interactive techniques.
- D-nerf: Neural radiance fields for dynamic scenes. In CVPR.
- Gernot Riegler and Vladlen Koltun. 2020. Free view synthesis. In ECCV.
- Dense depth priors for neural radiance fields from sparse input views. In CVPR.
- MixNeRF: Modeling a Ray with Mixture Density for Novel View Synthesis from Sparse Inputs. In CVPR.
- Garf: Geometry-aware generalized neural radiance field. arXiv preprint arXiv:2212.02280 (2022).
- 3d photography using context-aware layered depth inpainting. In CVPR.
- Deepvoxels: Learning persistent 3d feature embeddings. In CVPR.
- Nagabhushan Somraj and Rajiv Soundararajan. 2023. ViP-NeRF: Visibility Prior for Sparse Input Neural Radiance Fields. (2023).
- Pushing the boundaries of view extrapolation with multiplane images. In CVPR.
- Direct voxel grid optimization: Super-fast convergence for radiance fields reconstruction. In CVPR.
- Block-nerf: Scalable large scene neural view synthesis. In CVPR.
- Deferred neural rendering: Image synthesis using neural textures. ACM TOG (2019).
- Alex Trevithick and Bo Yang. 2021. Grf: Learning a general radiance field for 3d representation and rendering. In ICCV.
- Richard Tucker and Noah Snavely. 2020. Single-view view synthesis with multiplane images. In CVPR.
- Layer-structured 3d scene inference via view synthesis. In ECCV.
- SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates. In CVPR.
- Let there be color! Large-scale texturing of 3D reconstructions. In ECCV.
- Sparsenerf: Distilling depth ranking for few-shot novel view synthesis. In ICCV.
- Neus: Learning neural implicit surfaces by volume rendering for multi-view reconstruction. In NeurIPS.
- F2-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories. In CVPR.
- Ibrnet: Learning multi-view image-based rendering. In CVPR.
- Image quality assessment: from error visibility to structural similarity. IEEE TIP (2004).
- Nerfingmvs: Guided optimization of neural radiance fields for indoor multi-view stereo. In ICCV.
- Nex: Real-time view synthesis with neural basis expansion. In CVPR.
- Surface light fields for 3D photography. In Seminal Graphics Papers: Pushing the Boundaries, Volume 2.
- ReconFusion: 3D Reconstruction with Diffusion Priors. arXiv preprint arXiv:2312.02981 (2023).
- Jamie Wynn and Daniyar Turmukhambetov. 2023. Diffusionerf: Regularizing neural radiance fields with denoising diffusion models. In CVPR.
- Space-time neural irradiance fields for free-viewpoint video. In CVPR.
- Point-nerf: Point-based neural radiance fields. In CVPR.
- FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization. In CVPR.
- Mvsnet: Depth inference for unstructured multi-view stereo. In ECCV.
- Recurrent mvsnet for high-resolution multi-view stereo depth inference. In CVPR.
- Volume rendering of neural implicit surfaces. In NeurIPS.
- Multiview neural surface reconstruction by disentangling geometry and appearance. In NeurIPS.
- Plenoxels: Radiance fields without neural networks. In CVPR.
- Plenoctrees for real-time rendering of neural radiance fields. In ICCV.
- pixelnerf: Neural radiance fields from one or few images. In CVPR.
- Zehao Yu and Shenghua Gao. 2020. Fast-mvsnet: Sparse-to-dense multi-view stereo with learned propagation and gauss-newton refinement. In CVPR.
- Physg: Inverse rendering with spherical gaussians for physics-based material editing and relighting. In CVPR.
- Nerf++: Analyzing and improving neural radiance fields. arXiv preprint arXiv:2010.07492 (2020).
- The unreasonable effectiveness of deep features as a perceptual metric. In CVPR.
- Nerfusion: Fusing radiance fields for large-scale scene reconstruction. In CVPR.
- Nerfactor: Neural factorization of shape and reflectance under an unknown illumination. ACM TOG (2021).
- Stereo magnification: Learning view synthesis using multiplane images. In SIGGRAPH.
- Vdn-nerf: Resolving shape-radiance ambiguity via view-dependence normalization. In CVPR.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Collections
Sign up for free to add this paper to one or more collections.