Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction (2402.03762v5)

Published 6 Feb 2024 in cs.CV and cs.RO

Abstract: Monocular SLAM has received a lot of attention due to its simple RGB inputs and the lifting of complex sensor constraints. However, existing monocular SLAM systems are designed for bounded scenes, restricting the applicability of SLAM systems. To address this limitation, we propose MoD-SLAM, the first monocular NeRF-based dense mapping method that allows 3D reconstruction in real-time in unbounded scenes. Specifically, we introduce a Gaussian-based unbounded scene representation approach to solve the challenge of mapping scenes without boundaries. This strategy is essential to extend the SLAM application. Moreover, a depth estimation module in the front-end is designed to extract accurate priori depth values to supervise mapping and tracking processes. By introducing a robust depth loss term into the tracking process, our SLAM system achieves more precise pose estimation in large-scale scenes. Our experiments on two standard datasets show that MoD-SLAM achieves competitive performance, improving the accuracy of the 3D reconstruction and localization by up to 30% and 15% respectively compared with existing state-of-the-art monocular SLAM systems.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Heng Zhou (47 papers)
  2. Zhetao Guo (1 paper)
  3. Shuhong Liu (13 papers)
  4. Lechen Zhang (9 papers)
  5. Qihao Wang (4 papers)
  6. Yuxiang Ren (24 papers)
  7. Mingrui Li (14 papers)
Citations (9)

Summary

We haven't generated a summary for this paper yet.