- The paper introduces a dense RGB-D SLAM system that utilizes a consistent, uncertainty-aware 3D Gaussian field for enhanced mapping.
- It decouples camera pose derivatives and employs Gaussian splatting to achieve robust tracking and improved surface modeling.
- Experimental results demonstrate real-time performance up to 15 Hz with superior reconstruction quality across multiple datasets.
CG-SLAM: Enhancing Dense RGB-D SLAM with a 3D Gaussian Field
Introduction
Dense RGB-D SLAM technology has achieved notable successes in simultaneous localization and mapping (SLAM) by leveraging neural radiance fields (NeRF) for 3D representations. However, NeRF-based methods suffer from computational inefficiency, hindering their broader application in real-time systems. To address this, CG-SLAM introduces an efficient SLAM system based on a novel consistent and uncertainty-aware 3D Gaussian field. This approach leverages Gaussian Splatting techniques for surface modeling, achieving superior tracking and mapping performance with enhanced speed up to 15 Hz, a significant improvement over conventional methods.
Approach
CG-SLAM employs several innovative techniques to construct a stable and consistent 3D Gaussian field, suitable for accurate tracking and efficient mapping:
- Derivatives of Camera Poses in 3D Gaussian Splatting: A comprehensive mathematical analysis ensures that tracking and mapping processes are decoupled effectively, boosting the overall system's performance.
- Consistent and Stable 3D Gaussian Field: Through scale regularization and alignment of median depth with alpha-blending depth, the system ensures Gaussian primitives are well distributed over scene surfaces, improving geometric stability.
- Depth Uncertainty Model: A novel model improves system accuracy by focusing on stable and valuable Gaussian primitives during optimization, effectively filtering out outliers.
Experimental Evaluation
CG-SLAM was extensively tested across various datasets, including Replica, TUM, and ScanNet, demonstrating superior performance in tracking accuracy, reconstruction quality, and runtime efficiency. Compared to state-of-the-art NeRF-SLAM methods and concurrent Gaussian-based SLAM systems, CG-SLAM exhibits competitive or better results, highlighting its effectiveness in practical applications.
Implications and Future Work
The introduction of CG-SLAM opens new possibilities for efficient and accurate dense SLAM applications. Its superior performance could significantly benefit virtual/augmented reality, autonomous driving, and robotic navigation. Future developments could explore more compact representations of the 3D Gaussian field to reduce memory usage and improve system's predictability in unobserved areas. Additionally, extending the system to handle dynamic environments presents an interesting direction for further research.
Conclusion
CG-SLAM represents a significant advancement in dense RGB-D SLAM by integrating a consistent and efficient 3D Gaussian field. Its capability to achieve high tracking accuracy, detailed reconstruction, and real-time performance positions it as a powerful tool for various 3D computer vision applications. With room for further improvements and adaptations, CG-SLAM sets a new benchmark for future research in the field of SLAM technology.