Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards Autonomous Driving: a Multi-Modal 360$^{\circ}$ Perception Proposal (2008.09672v1)

Published 21 Aug 2020 in cs.CV and cs.RO

Abstract: In this paper, a multi-modal 360${\circ}$ framework for 3D object detection and tracking for autonomous vehicles is presented. The process is divided into four main stages. First, images are fed into a CNN network to obtain instance segmentation of the surrounding road participants. Second, LiDAR-to-image association is performed for the estimated mask proposals. Then, the isolated points of every object are processed by a PointNet ensemble to compute their corresponding 3D bounding boxes and poses. Lastly, a tracking stage based on Unscented Kalman Filter is used to track the agents along time. The solution, based on a novel sensor fusion configuration, provides accurate and reliable road environment detection. A wide variety of tests of the system, deployed in an autonomous vehicle, have successfully assessed the suitability of the proposed perception stack in a real autonomous driving application.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Jorge Beltrán (8 papers)
  2. Carlos Guindel (6 papers)
  3. Irene Cortés (3 papers)
  4. Alejandro Barrera (4 papers)
  5. Armando Astudillo (1 paper)
  6. Jesús Urdiales (1 paper)
  7. Farid Bekka (1 paper)
  8. Vicente Milanés (2 papers)
  9. Fernando García (15 papers)
  10. Mario Álvarez (1 paper)
Citations (13)

Summary

We haven't generated a summary for this paper yet.