Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LiDAR-Based 3D Object Detection via Hybrid 2D Semantic Scene Generation (2304.01519v1)

Published 4 Apr 2023 in cs.CV

Abstract: Bird's-Eye View (BEV) features are popular intermediate scene representations shared by the 3D backbone and the detector head in LiDAR-based object detectors. However, little research has been done to investigate how to incorporate additional supervision on the BEV features to improve proposal generation in the detector head, while still balancing the number of powerful 3D layers and efficient 2D network operations. This paper proposes a novel scene representation that encodes both the semantics and geometry of the 3D environment in 2D, which serves as a dense supervision signal for better BEV feature learning. The key idea is to use auxiliary networks to predict a combination of explicit and implicit semantic probabilities by exploiting their complementary properties. Extensive experiments show that our simple yet effective design can be easily integrated into most state-of-the-art 3D object detectors and consistently improves upon baseline models.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Haitao Yang (62 papers)
  2. Zaiwei Zhang (16 papers)
  3. Xiangru Huang (8 papers)
  4. Min Bai (14 papers)
  5. Chen Song (21 papers)
  6. Bo Sun (100 papers)
  7. Li Erran Li (37 papers)
  8. Qixing Huang (78 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.