Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Simple Baseline for Supervised Surround-view Depth Estimation (2303.07759v3)

Published 14 Mar 2023 in cs.CV

Abstract: Depth estimation has been widely studied and serves as the fundamental step of 3D perception for intelligent vehicles. Though significant progress has been made in monocular depth estimation in the past decades, these attempts are mainly conducted on the KITTI benchmark with only front-view cameras, which ignores the correlations across surround-view cameras. In this paper, we propose S3Depth, a Simple Baseline for Supervised Surround-view Depth Estimation, to jointly predict the depth maps across multiple surrounding cameras. Specifically, we employ a global-to-local feature extraction module which combines CNN with transformer layers for enriched representations. Further, the Adjacent-view Attention mechanism is proposed to enable the intra-view and inter-view feature propagation. The former is achieved by the self-attention module within each view, while the latter is realized by the adjacent attention module, which computes the attention across multi-cameras to exchange the multi-scale representations across surround-view feature maps. Extensive experiments show that our method achieves superior performance over existing state-of-the-art methods on both DDAD and nuScenes datasets.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Xianda Guo (23 papers)
  2. Wenjie Yuan (1 paper)
  3. Yunpeng Zhang (31 papers)
  4. Tian Yang (46 papers)
  5. Chenming Zhang (10 papers)
  6. Zheng Zhu (200 papers)
  7. Long Chen (396 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.