Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Robust Environment Perception for Automated Driving: A Unified Learning Pipeline for Visual-Infrared Object Detection (2206.03943v1)

Published 8 Jun 2022 in cs.CV, cs.IT, and math.IT

Abstract: The RGB complementary metal-oxidesemiconductor (CMOS) sensor works within the visible light spectrum. Therefore it is very sensitive to environmental light conditions. On the contrary, a long-wave infrared (LWIR) sensor operating in 8-14 micro meter spectral band, functions independent of visible light. In this paper, we exploit both visual and thermal perception units for robust object detection purposes. After delicate synchronization and (cross-) labeling of the FLIR [1] dataset, this multi-modal perception data passes through a convolutional neural network (CNN) to detect three critical objects on the road, namely pedestrians, bicycles, and cars. After evaluation of RGB and infrared (thermal and infrared are often used interchangeably) sensors separately, various network structures are compared to fuse the data at the feature level effectively. Our RGB-thermal (RGBT) fusion network, which takes advantage of a novel entropy-block attention module (EBAM), outperforms the state-of-the-art network [2] by 10% with 82.9% mAP.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Mohsen Vadidar (1 paper)
  2. Ali Kariminezhad (17 papers)
  3. Christian Mayr (35 papers)
  4. Laurent Kloeker (8 papers)
  5. Lutz Eckstein (42 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.