Improving Robustness for Joint Optimization of Camera Poses and Decomposed Low-Rank Tensorial Radiance Fields (2402.13252v1)
Abstract: In this paper, we propose an algorithm that allows joint refinement of camera pose and scene geometry represented by decomposed low-rank tensor, using only 2D images as supervision. First, we conduct a pilot study based on a 1D signal and relate our findings to 3D scenarios, where the naive joint pose optimization on voxel-based NeRFs can easily lead to sub-optimal solutions. Moreover, based on the analysis of the frequency spectrum, we propose to apply convolutional Gaussian filters on 2D and 3D radiance fields for a coarse-to-fine training schedule that enables joint camera pose optimization. Leveraging the decomposition property in decomposed low-rank tensor, our method achieves an equivalent effect to brute-force 3D convolution with only incurring little computational overhead. To further improve the robustness and stability of joint optimization, we also propose techniques of smoothed 2D supervision, randomly scaled kernel parameters, and edge-guided loss mask. Extensive quantitative and qualitative evaluations demonstrate that our proposed framework achieves superior performance in novel view synthesis as well as rapid convergence for optimization.
- Tensorf: Tensorial radiance fields. In Proceedings of the European Conference on Computer Vision (ECCV).
- Local-to-global registration for bundle-adjusting neural radiance fields. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Gaussian activated neural radiance fields for high fidelity reconstruction and pose estimation. In Proceedings of the European Conference on Computer Vision (ECCV).
- K-Planes: Explicit Radiance Fields in Space, Time, and Appearance. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Plenoxels: Radiance Fields without Neural Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- StyleTRF: Stylizing Tensorial Radiance Fields. In Proceedings of the Thirteenth Indian Conference on Computer Vision, Graphics and Image Processing.
- Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Baking Neural Radiance Fields for Real-Time View Synthesis. Proceedings of the IEEE International Conference on Computer Vision (ICCV).
- Robust Camera Pose Refinement for Multi-Resolution Hash Encoding. In Proceedings of the International Conference on Machine Learning (ICML).
- TriVol: Point Cloud Rendering via Triple Volumes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Ray tracing volume densities. ACM SIGGRAPH computer graphics.
- Design of an image edge detection filter using the Sobel operator. IEEE Journal of solid-state circuits.
- Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra. arXiv preprint arXiv:2304.09987.
- BARF: Bundle-Adjusting Neural Radiance Fields. In Proceedings of the IEEE International Conference on Computer Vision (ICCV).
- Neural Sparse Voxel Fields. Advances in Neural Information Processing Systems (NeurIPS).
- Robust Dynamic Radiance Fields. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Progressively Optimized Local Radiance Fields for Robust View Synthesis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis. In Proceedings of the European Conference on Computer Vision (ECCV).
- Instant Neural Graphics Primitives with a Multiresolution Hash Encoding. ACM Transactions on Graphics (TOG).
- Structure-from-Motion Revisited. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Tensor4D: Efficient Neural 4D Decomposition for High-Fidelity Dynamic Reconstruction and Rendering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Compressible-Composable NeRF via Rank-residual Decomposition. Advances in Neural Information Processing Systems.
- Fourier plenoctrees for dynamic radiance field rendering in real-time. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- NeRF−−--- -: Neural Radiance Fields Without Known Camera Parameters. arXiv preprint arXiv:2102.07064.
- Point-nerf: Point-based neural radiance fields. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- AvatarMAV: Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels. In ACM SIGGRAPH 2023 Conference Proceedings.
- PlenOctrees for Real-time Rendering of Neural Radiance Fields. Proceedings of the IEEE International Conference on Computer Vision (ICCV).
- A structured dictionary perspective on implicit neural representations. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- Nerf++: Analyzing and improving neural radiance fields. arXiv preprint arXiv:2010.07492.