Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

High-Fidelity SLAM Using Gaussian Splatting with Rendering-Guided Densification and Regularized Optimization (2403.12535v2)

Published 19 Mar 2024 in cs.RO and cs.CV

Abstract: We propose a dense RGBD SLAM system based on 3D Gaussian Splatting that provides metrically accurate pose tracking and visually realistic reconstruction. To this end, we first propose a Gaussian densification strategy based on the rendering loss to map unobserved areas and refine reobserved areas. Second, we introduce extra regularization parameters to alleviate the forgetting problem in the continuous mapping problem, where parameters tend to overfit the latest frame and result in decreasing rendering quality for previous frames. Both mapping and tracking are performed with Gaussian parameters by minimizing re-rendering loss in a differentiable way. Compared to recent neural and concurrently developed gaussian splatting RGBD SLAM baselines, our method achieves state-of-the-art results on the synthetic dataset Replica and competitive results on the real-world dataset TUM.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. Raul Mur-Artal, Jose Maria Martinez Montiel and Juan D Tardos “ORB-SLAM: a versatile and accurate monocular SLAM system” In IEEE transactions on robotics 31.5 IEEE, 2015, pp. 1147–1163
  2. “Kinectfusion: Real-time dense surface mapping and tracking” In 2011 10th IEEE international symposium on mixed and augmented reality, 2011, pp. 127–136 Ieee
  3. “inerf: Inverting neural radiance fields for pose estimation” In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 1323–1330 IEEE
  4. “Nice-slam: Neural implicit scalable encoding for slam” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 12786–12796
  5. Mohammad Mahdi Johari, Camilla Carta and François Fleuret “Eslam: Efficient dense slam system based on hybrid representation of signed distance fields” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 17408–17419
  6. Hengyi Wang, Jingwen Wang and Lourdes Agapito “Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 13293–13302
  7. “iMAP: Implicit mapping and positioning in real-time” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 6229–6238
  8. “Point-slam: Dense neural point cloud-based slam” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 18433–18444
  9. “3d gaussian splatting for real-time radiance field rendering” In ACM Transactions on Graphics (ToG) 42.4 ACM New York, NY, USA, 2023, pp. 1–14
  10. Johannes L Schonberger and Jan-Michael Frahm “Structure-from-motion revisited” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 4104–4113
  11. “Splatam: Splat, track & map 3d gaussians for dense rgb-d slam” In arXiv preprint arXiv:2312.02126, 2023
  12. “Gs-slam: Dense visual slam with 3d gaussian splatting” In arXiv preprint arXiv:2311.11700, 2023
  13. Jakob Engel, Vladlen Koltun and Daniel Cremers “Direct sparse odometry” In IEEE transactions on pattern analysis and machine intelligence 40.3 IEEE, 2017, pp. 611–625
  14. “Nerf: Representing scenes as neural radiance fields for view synthesis” In Communications of the ACM 65.1 ACM New York, NY, USA, 2021, pp. 99–106
  15. “Gaussian splatting slam” In arXiv preprint arXiv:2312.06741, 2023
  16. “Gaussian-slam: Photo-realistic dense slam with gaussian splatting” In arXiv preprint arXiv:2312.10070, 2023
  17. “Photo-slam: Real-time simultaneous localization and photorealistic mapping for monocular, stereo, and rgb-d cameras” In arXiv preprint arXiv:2311.16728, 2023
  18. “Instant neural graphics primitives with a multiresolution hash encoding” In ACM Transactions on Graphics (ToG) 41.4 ACM New York, NY, USA, 2022, pp. 1–15
  19. “Zip-NeRF: Anti-aliased grid-based neural radiance fields” In arXiv preprint arXiv:2304.06706, 2023
  20. “Plenoxels: Radiance fields without neural networks” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 5501–5510
  21. Michael McCloskey and Neal J Cohen “Catastrophic interference in connectionist networks: The sequential learning problem” In Psychology of learning and motivation 24 Elsevier, 1989, pp. 109–165
  22. “Self6d: Self-supervised monocular 6d object pose estimation” In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I 16, 2020, pp. 108–125 Springer
  23. “A volumetric method for building complex models from range images” In Proceedings of the 23rd annual conference on Computer graphics and interactive techniques, 1996, pp. 303–312
  24. “The Replica dataset: A digital replica of indoor spaces” In arXiv preprint arXiv:1906.05797, 2019
  25. “A benchmark for the evaluation of RGB-D SLAM systems” In 2012 IEEE/RSJ international conference on intelligent robots and systems, 2012, pp. 573–580 IEEE
  26. “Barf: Bundle-adjusting neural radiance fields” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 5741–5751
  27. “Loopy-SLAM: Dense Neural SLAM with Loop Closures” In arXiv preprint arXiv:2402.09944, 2024
  28. “Lerf: Language embedded radiance fields” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 19729–19739
  29. “LangSplat: 3D Language Gaussian Splatting” In arXiv preprint arXiv:2312.16084, 2023
Citations (8)

Summary

We haven't generated a summary for this paper yet.

Youtube Logo Streamline Icon: https://streamlinehq.com