CombiNeRF: A Combination of Regularization Techniques for Few-Shot Neural Radiance Field View Synthesis (2403.14412v1)
Abstract: Neural Radiance Fields (NeRFs) have shown impressive results for novel view synthesis when a sufficiently large amount of views are available. When dealing with few-shot settings, i.e. with a small set of input views, the training could overfit those views, leading to artifacts and geometric and chromatic inconsistencies in the resulting rendering. Regularization is a valid solution that helps NeRF generalization. On the other hand, each of the most recent NeRF regularization techniques aim to mitigate a specific rendering problem. Starting from this observation, in this paper we propose CombiNeRF, a framework that synergically combines several regularization techniques, some of them novel, in order to unify the benefits of each. In particular, we regularize single and neighboring rays distributions and we add a smoothness term to regularize near geometries. After these geometric approaches, we propose to exploit Lipschitz regularization to both NeRF density and color networks and to use encoding masks for input features regularization. We show that CombiNeRF outperforms the state-of-the-art methods with few-shot settings in several publicly available datasets. We also present an ablation study on the LLFF and NeRF-Synthetic datasets that support the choices made. We release with this paper the open-source implementation of our framework.
- Learning neural light fields with ray-space embedding networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
- Self-nerf: A self-training pipeline for few-shot neural radiance fields, 2023.
- Mip-nerf 360: Unbounded anti-aliased neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5470–5479, 2022.
- GeNVS: Generative novel view synthesis with 3D-aware diffusion models. In arXiv, 2023.
- Aug-nerf: Training stronger neural radiance fields with triple-level physically-grounded augmentations. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
- Sparsenerf: Distilling depth ranking for few-shot novel view synthesis. IEEE/CVF International Conference on Computer Vision (ICCV), 2023.
- Nerfren: Neural radiance fields with reflections. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 18409–18418, 2022.
- Consistentnerf: Enhancing neural radiance fields with 3d consistency for sparse view synthesis, 2023.
- Putting nerf on a diet: Semantically consistent few-shot view synthesis. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 5885–5894, 2021.
- Infonerf: Ray entropy minimization for few-shot neural volume rendering. In CVPR, 2022.
- Neuralangelo: High-fidelity neural surface reconstruction. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
- Neuralblox: Real-time neural representation fusion for robust volumetric mapping. 2021 International Conference on 3D Vision (3DV), 2021.
- Learning smooth neural functions via lipschitz regularization. Special Interest Group on Computer Graphics and Interactive Techniques Conference Proceedings, 2022.
- Neural volumes: Learning dynamic renderable volumes from images. ACM Trans. Graph., 38(4):65:1–65:14, 2019.
- Local light field fusion: Practical view synthesis with prescriptive sampling guidelines. ACM Transactions on Graphics (TOG), 38(4):1–14, 2019.
- Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 65(1):99–106, 2021.
- NeRF in the dark: High dynamic range view synthesis from noisy raw images. CVPR, 2022.
- Instant neural graphics primitives with a multiresolution hash encoding. ACM Trans. Graph., 41(4):102:1–102:15, 2022.
- DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks. Computer Graphics Forum, 40(4), 2021.
- Rgbd-net: Predicting color and depth images for novel views synthesis. In Proceedings of the International Conference on 3D Vision, 2021.
- Campari: Camera-aware decomposed generative neural radiance fields. In International Conference on 3D Vision (3DV), 2021.
- Differentiable volumetric rendering: Learning implicit 3d representations without 3d supervision. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- Regnerf: Regularizing neural radiance fields for view synthesis from sparse inputs. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2022.
- Terminerf: Ray termination prediction for efficient neural rendering. In 2021 International Conference on 3D Vision (3DV), pages 1106–1114, Los Alamitos, CA, USA, 2021. IEEE Computer Society.
- Cross-spectral neural radiance fields. In Proceedings of the International Conference on 3D Vision, 2022. 3DV.
- D-NeRF: Neural Radiance Fields for Dynamic Scenes. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.
- Permutosdf: Fast multi-view reconstruction with implicit surfaces using permutohedral lattices. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
- Plenoxels: Radiance fields without neural networks. In CVPR, 2022.
- Lipschitz regularity of deep neural networks: analysis and efficient estimation. In Neural Information Processing Systems, 2018.
- Novel view synthesis of dynamic scenes with globally coherent depths from a monocular camera. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- Deepvoxels: Learning persistent 3d feature embeddings. In Proc. Computer Vision and Pattern Recognition (CVPR), IEEE, 2019.
- Harnessing low-frequency neural fields for few-shot view synthesis, 2023.
- Block-nerf: Scalable large scene neural view synthesis. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 8238–8248, Los Alamitos, CA, USA, 2022. IEEE Computer Society.
- Jiaxiang Tang. Torch-ngp: A pytorch implementation of instant-ngp, 2022.
- Ref-NeRF: Structured view-dependent appearance for neural radiance fields. CVPR, 2022.
- Go-surf: Neural feature grid optimization for fast, high-fidelity rgb-d surface reconstruction. 2022 International Conference on 3D Vision (3DV), 2022a.
- Neus: Learning neural implicit surfaces by volume rendering for multi-view reconstruction. CoRR, abs/2106.10689, 2021.
- F2-nerf: Fast neural radiance field training with free camera trajectories. CVPR, 2023.
- Hf-neus: Improved surface reconstruction using high-frequency details. Advances in Neural Information Processing Systems, 35:1966–1978, 2022b.
- Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4):600–612, 2004.
- Diver: Real-time and accurate neural radiance fields with deterministic integration for volume rendering. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
- DiffusioNeRF: Regularizing Neural Radiance Fields with Denoising Diffusion Models. In CVPR, 2023.
- Fig-nerf: Figure-ground neural radiance fields for 3d object category modelling. In International Conference on 3D Vision (3DV), 2021.
- Freenerf: Improving few-shot neural rendering with free frequency regularization. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2023.
- Multiview neural surface reconstruction by disentangling geometry and appearance. Advances in Neural Information Processing Systems, 33, 2020.
- pixelNeRF: Neural radiance fields from one or few images. In CVPR, 2021.
- PhySG: Inverse rendering with spherical gaussians for physics-based material editing and relighting. In The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
- Iron: Inverse rendering by optimizing neural sdfs and materials from photometric images. In IEEE Conf. Comput. Vis. Pattern Recog., 2022.
- The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 586–595, 2018.
- Human performance modeling and rendering via neural animated mesh. ACM Trans. Graph., 41(6), 2022.
- Dual-space nerf: Learning animatable avatars and scene lighting in separate spaces. In International Conference on 3D Vision (3DV), 2022.