MotionGS : Compact Gaussian Splatting SLAM by Motion Filter (2405.11129v2)
Abstract: With their high-fidelity scene representation capability, the attention of SLAM field is deeply attracted by the Neural Radiation Field (NeRF) and 3D Gaussian Splatting (3DGS). Recently, there has been a surge in NeRF-based SLAM, while 3DGS-based SLAM is sparse. A novel 3DGS-based SLAM approach with a fusion of deep visual feature, dual keyframe selection and 3DGS is presented in this paper. Compared with the existing methods, the proposed tracking is achieved by feature extraction and motion filter on each frame. The joint optimization of poses and 3D Gaussians runs through the entire mapping process. Additionally, the coarse-to-fine pose estimation and compact Gaussian scene representation are implemented by dual keyframe selection and novel loss functions. Experimental results demonstrate that the proposed algorithm not only outperforms the existing methods in tracking and mapping, but also has less memory usage.
- Simultaneous localization and mapping: A survey of current trends in autonomous driving. IEEE Transactions on Intelligent Vehicles, 2:194–220, 2017.
- Mam3slam: Towards underwater-robust multi-agent visual slam. Ocean Engineering, 302, 2024.
- A slam-based 6dof controller with smooth auto-calibration for virtual reality. The Visual Computer, 39:1–14, 06 2022.
- Visual slam algorithms and their application for ar, mapping, localization and wayfinding. Array, 15:100–222, 2022.
- Orb-slam3: An accurate open-source library for visual, visual–inertial, and multimap slam. IEEE Transactions on Robotics, 37(6):1874–1890, 2021.
- Orb-slam2: An open-source slam system for monocular, stereo, and rgb-d cameras. IEEE Transactions on Robotics, 33(5):1255–1262, 2017.
- Real-time 3d reconstruction in dynamic scenes using point-based fusion. In 2013 International Conference on 3D Vision - 3DV 2013, pages 1–8, 2013.
- Real-time scalable dense surfel mapping. In 2019 International Conference on Robotics and Automation (ICRA), pages 6919–6925, 2019.
- Elasticfusion: Real-time dense slam and light source estimation. The International Journal of Robotics Research, 35(14):1697–1716, 2016.
- Ovpc mesh: 3d free-space representation for local ground vehicle navigation. In 2019 International Conference on Robotics and Automation (ICRA), pages 8648–8654, 2019.
- Surfelmeshing: Online surfel-based mesh reconstruction. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(10):2494–2507, 2020.
- Real-time 3d reconstruction at scale using voxel hashing. ACM Trans. Graph., 32(6), nov 2013.
- Direct voxel grid optimization: Super-fast convergence for radiance fields reconstruction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5459–5469, June 2022.
- Plenoxels: Radiance fields without neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5501–5510, June 2022.
- Kinectfusion: Real-time dense surface mapping and tracking. In 2011 10th IEEE International Symposium on Mixed and Augmented Reality, pages 127–136, 2011.
- Nerf: Representing scenes as neural radiance fields for view synthesis, 2020.
- imap: Implicit mapping and positioning in real-time. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 6229–6238, October 2021.
- Vox-fusion: Dense tracking and mapping with voxel-based neural implicit representation. In 2022 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), pages 499–507, 2022.
- Nice-slam: Neural implicit scalable encoding for slam. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 12786–12796, June 2022.
- Eslam: Efficient dense slam system based on hybrid representation of signed distance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 17408–17419, June 2023.
- Go-slam: Global optimization for consistent 3d instant reconstruction. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 3727–3737, October 2023.
- Co-slam: Joint coordinate and sparse parametric encodings for neural real-time slam. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 13293–13302, June 2023.
- Point-slam: Dense neural point cloud-based slam. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 18433–18444, October 2023.
- Nerf-slam: Real-time dense monocular slam with neural radiance fields. In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 3437–3444, 2023.
- Gs-slam: Dense visual slam with 3d gaussian splatting, 2024.
- 3d gaussian splatting for real-time radiance field rendering, 2023.
- Gaussian splatting slam, 2024.
- Splatam: Splat, track & map 3d gaussians for dense rgb-d slam, 2024.
- Dtam: Dense tracking and mapping in real-time. In 2011 International Conference on Computer Vision, pages 2320–2327, 2011.
- Kintinuous : Spatially extended kinectfusion. 01 2012.
- Droid-slam: Deep visual slam for monocular, stereo, and rgb-d cameras. In M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems, volume 34, pages 16558–16569. Curran Associates, Inc., 2021.
- Hi-slam: Monocular real-time dense mapping with hybrid implicit fields. IEEE Robotics and Automation Letters, 9(2):1548–1555, 2024.
- Photo-slam: Real-time simultaneous localization and photorealistic mapping for monocular, stereo, and rgb-d cameras, 2024.
- Gaussian-slam: Photo-realistic dense slam with gaussian splatting, 2024.
- Rgbd gs-icp slam, 2024.
- Compact 3d gaussian representation for radiance field, 2024.
- Estimating or propagating gradients through stochastic neurons for conditional computation, 2013.
- A micro lie theory for state estimation in robotics, 2021.
- A benchmark for the evaluation of rgb-d slam systems. In 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 573–580, 2012.
- The replica dataset: A digital replica of indoor spaces, 2019.