Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images (2311.13398v3)
Abstract: In this paper, we present a method to optimize Gaussian splatting with a limited number of images while avoiding overfitting. Representing a 3D scene by combining numerous Gaussian splats has yielded outstanding visual quality. However, it tends to overfit the training views when only a small number of images are available. To address this issue, we introduce a dense depth map as a geometry guide to mitigate overfitting. We obtained the depth map using a pre-trained monocular depth estimation model and aligning the scale and offset using sparse COLMAP feature points. The adjusted depth aids in the color-based optimization of 3D Gaussian splatting, mitigating floating artifacts, and ensuring adherence to geometric constraints. We verify the proposed method on the NeRF-LLFF dataset with varying numbers of few images. Our approach demonstrates robust geometry compared to the original method that relies solely on images. Project page: robot0321.github.io/DepthRegGS
- Real-time rendering. Crc Press, 2019.
- Neural rgb-d surface reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021.
- Mip-nerf 360: Unbounded anti-aliased neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- Zoedepth: Zero-shot transfer by combining relative and metric depth. arXiv preprint arXiv:2302.12288, 2023.
- Nope-nerf: Optimising neural radiance field with no pose prior. 2023.
- Neural surface reconstruction of dynamic scenes with monocular rgb-d camera. Advances in Neural Information Processing Systems, 2022.
- John Canny. A computational approach to edge detection. IEEE Transactions on pattern analysis and machine intelligence, 1986.
- Efficient geometry-aware 3d generative adversarial networks. In CVPR, 2022.
- Tensorf: Tensorial radiance fields. In ECCV, 2022.
- Mobilenerf: Exploiting the polygon rasterization pipeline for efficient neural field rendering on mobile architectures. In CVPR, 2023.
- Texture splats for 3d scalar and vector field visualization. In Proceedings Visualization’93, 1993.
- Depth-supervised nerf: Fewer views and faster training for free. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022a.
- Fov-nerf: Foveated neural radiance fields for virtual reality. IEEE Transactions on Visualization and Computer Graphics, 2022b.
- Mip-nerf rgb-d: Depth assisted fast neural radiance fields. arXiv preprint arXiv:2205.09351, 2022.
- Plenoxels: Radiance fields without neural networks. In CVPR, 2022.
- Multi-view stereo: A tutorial. Foundations and Trends® in Computer Graphics and Vision, 2015.
- Nerf: Neural radiance field in 3d vision, a comprehensive review. arXiv preprint arXiv:2210.00379, 2022.
- Fastnerf: High-fidelity neural rendering at 200fps. In ICCV, 2021.
- Unsupervised monocular depth estimation with left-right consistency. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2017.
- Image-based 3d object reconstruction: State-of-the-art and trends in the deep learning era. PAMI, 2019.
- Multiple view geometry in computer vision. Cambridge university press, 2003.
- 3d gaussian splatting for real-time radiance field rendering. TOG, 2023a.
- 3d gaussian splatting for real-time radiance field rendering. ACM Transactions on Graphics, 2023b.
- Infonerf: Ray entropy minimization for few-shot neural volume rendering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- Point-based neural rendering with per-view optimization. In Computer Graphics Forum, 2021.
- Neural sparse voxel fields. NIPS, 2020.
- Object scene flow for autonomous vehicles. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2015.
- Single image depth estimation: An overview. Digital Signal Processing, 2022.
- Local light field fusion: Practical view synthesis with prescriptive sampling guidelines. ACM Transactions on Graphics (TOG), 2019.
- Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 2021.
- Real-time neural radiance caching for path tracing. arXiv preprint arXiv:2106.12372, 2021.
- Instant neural graphics primitives with a multiresolution hash encoding. TOG, 2022.
- Donerf: Towards real-time rendering of compact neural radiance fields using depth oracle networks. In Computer Graphics Forum, 2021.
- Regnerf: Regularizing neural radiance fields for view synthesis from sparse inputs. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- Computational geometry: an introduction. Springer Science & Business Media, 2012.
- Diner: Depth-aware image-based neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023.
- Michael Reed. Methods of modern mathematical physics: Functional analysis. Elsevier, 2012.
- Kilonerf: Speeding up neural radiance fields with thousands of tiny mlps. In ICCV, 2021.
- Dense depth priors for neural radiance fields from sparse input views. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- Structure-from-motion revisited. In CVPR, 2016.
- Indoor segmentation and support inference from rgbd images. In Computer Vision–ECCV 2012: 12th European Conference on Computer Vision, Florence, Italy, October 7-13, 2012, Proceedings, Part V 12, 2012.
- Direct voxel grid optimization: Super-fast convergence for radiance fields reconstruction. In CVPR, 2022.
- Advances in neural rendering. In Computer Graphics Forum, 2022.
- Shape and motion from image streams under orthography: a factorization method. International journal of computer vision, 1992.
- Shimon Ullman. The interpretation of structure from motion. Proceedings of the Royal Society of London. Series B. Biological Sciences, 1979.
- Neus: Learning neural implicit surfaces by volume rendering for multi-view reconstruction. arXiv preprint arXiv:2106.10689, 2021a.
- Nerf–: Neural radiance fields without known camera parameters. arXiv preprint arXiv:2102.07064, 2021b.
- Nerfingmvs: Guided optimization of neural radiance fields for indoor multi-view stereo. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021.
- Synsin: End-to-end view synthesis from a single image. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.
- Neural fields in visual computing and beyond. In Computer Graphics Forum, 2022.
- Point-nerf: Point-based neural radiance fields. In CVPR, 2022.
- Nerfvs: Neural radiance fields for free view synthesis via geometry scaffolds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023.
- Volume rendering of neural implicit surfaces. Advances in Neural Information Processing Systems, 2021.
- Differentiable surface splatting for point-based geometry processing. ACM Transactions on Graphics (TOG), 2019.
- Plenoctrees for real-time rendering of neural radiance fields. In ICCV, 2021.
- Nerf++: Analyzing and improving neural radiance fields. arXiv preprint arXiv:2010.07492, 2020.
- Unleashing text-to-image diffusion models for visual perception. arXiv preprint arXiv:2303.02153, 2023.
- Unsupervised learning of stereo matching. In Proceedings of the IEEE International Conference on Computer Vision, 2017.