High-Fidelity SLAM Using Gaussian Splatting with Rendering-Guided Densification and Regularized Optimization (2403.12535v2)
Abstract: We propose a dense RGBD SLAM system based on 3D Gaussian Splatting that provides metrically accurate pose tracking and visually realistic reconstruction. To this end, we first propose a Gaussian densification strategy based on the rendering loss to map unobserved areas and refine reobserved areas. Second, we introduce extra regularization parameters to alleviate the forgetting problem in the continuous mapping problem, where parameters tend to overfit the latest frame and result in decreasing rendering quality for previous frames. Both mapping and tracking are performed with Gaussian parameters by minimizing re-rendering loss in a differentiable way. Compared to recent neural and concurrently developed gaussian splatting RGBD SLAM baselines, our method achieves state-of-the-art results on the synthetic dataset Replica and competitive results on the real-world dataset TUM.
- Raul Mur-Artal, Jose Maria Martinez Montiel and Juan D Tardos “ORB-SLAM: a versatile and accurate monocular SLAM system” In IEEE transactions on robotics 31.5 IEEE, 2015, pp. 1147–1163
- “Kinectfusion: Real-time dense surface mapping and tracking” In 2011 10th IEEE international symposium on mixed and augmented reality, 2011, pp. 127–136 Ieee
- “inerf: Inverting neural radiance fields for pose estimation” In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 1323–1330 IEEE
- “Nice-slam: Neural implicit scalable encoding for slam” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 12786–12796
- Mohammad Mahdi Johari, Camilla Carta and François Fleuret “Eslam: Efficient dense slam system based on hybrid representation of signed distance fields” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 17408–17419
- Hengyi Wang, Jingwen Wang and Lourdes Agapito “Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 13293–13302
- “iMAP: Implicit mapping and positioning in real-time” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 6229–6238
- “Point-slam: Dense neural point cloud-based slam” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 18433–18444
- “3d gaussian splatting for real-time radiance field rendering” In ACM Transactions on Graphics (ToG) 42.4 ACM New York, NY, USA, 2023, pp. 1–14
- Johannes L Schonberger and Jan-Michael Frahm “Structure-from-motion revisited” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 4104–4113
- “Splatam: Splat, track & map 3d gaussians for dense rgb-d slam” In arXiv preprint arXiv:2312.02126, 2023
- “Gs-slam: Dense visual slam with 3d gaussian splatting” In arXiv preprint arXiv:2311.11700, 2023
- Jakob Engel, Vladlen Koltun and Daniel Cremers “Direct sparse odometry” In IEEE transactions on pattern analysis and machine intelligence 40.3 IEEE, 2017, pp. 611–625
- “Nerf: Representing scenes as neural radiance fields for view synthesis” In Communications of the ACM 65.1 ACM New York, NY, USA, 2021, pp. 99–106
- “Gaussian splatting slam” In arXiv preprint arXiv:2312.06741, 2023
- “Gaussian-slam: Photo-realistic dense slam with gaussian splatting” In arXiv preprint arXiv:2312.10070, 2023
- “Photo-slam: Real-time simultaneous localization and photorealistic mapping for monocular, stereo, and rgb-d cameras” In arXiv preprint arXiv:2311.16728, 2023
- “Instant neural graphics primitives with a multiresolution hash encoding” In ACM Transactions on Graphics (ToG) 41.4 ACM New York, NY, USA, 2022, pp. 1–15
- “Zip-NeRF: Anti-aliased grid-based neural radiance fields” In arXiv preprint arXiv:2304.06706, 2023
- “Plenoxels: Radiance fields without neural networks” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 5501–5510
- Michael McCloskey and Neal J Cohen “Catastrophic interference in connectionist networks: The sequential learning problem” In Psychology of learning and motivation 24 Elsevier, 1989, pp. 109–165
- “Self6d: Self-supervised monocular 6d object pose estimation” In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I 16, 2020, pp. 108–125 Springer
- “A volumetric method for building complex models from range images” In Proceedings of the 23rd annual conference on Computer graphics and interactive techniques, 1996, pp. 303–312
- “The Replica dataset: A digital replica of indoor spaces” In arXiv preprint arXiv:1906.05797, 2019
- “A benchmark for the evaluation of RGB-D SLAM systems” In 2012 IEEE/RSJ international conference on intelligent robots and systems, 2012, pp. 573–580 IEEE
- “Barf: Bundle-adjusting neural radiance fields” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 5741–5751
- “Loopy-SLAM: Dense Neural SLAM with Loop Closures” In arXiv preprint arXiv:2402.09944, 2024
- “Lerf: Language embedded radiance fields” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 19729–19739
- “LangSplat: 3D Language Gaussian Splatting” In arXiv preprint arXiv:2312.16084, 2023