Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Disentangling and Vectorization: A 3D Visual Perception Approach for Autonomous Driving Based on Surround-View Fisheye Cameras (2107.08862v1)

Published 19 Jul 2021 in cs.CV

Abstract: The 3D visual perception for vehicles with the surround-view fisheye camera system is a critical and challenging task for low-cost urban autonomous driving. While existing monocular 3D object detection methods perform not well enough on the fisheye images for mass production, partly due to the lack of 3D datasets of such images. In this paper, we manage to overcome and avoid the difficulty of acquiring the large scale of accurate 3D labeled truth data, by breaking down the 3D object detection task into some sub-tasks, such as vehicle's contact point detection, type classification, re-identification and unit assembling, etc. Particularly, we propose the concept of Multidimensional Vector to include the utilizable information generated in different dimensions and stages, instead of the descriptive approach for the bird's eye view (BEV) or a cube of eight points. The experiments of real fisheye images demonstrate that our solution achieves state-of-the-art accuracy while being real-time in practice.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Zizhang Wu (22 papers)
  2. Wenkai Zhang (15 papers)
  3. Jizheng Wang (2 papers)
  4. Man Wang (14 papers)
  5. Yuanzhu Gan (9 papers)
  6. Xinchao Gou (1 paper)
  7. Muqing Fang (3 papers)
  8. Jing Song (36 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.