Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving (2303.17144v3)

Published 30 Mar 2023 in cs.CV, cs.AI, cs.MM, and cs.RO

Abstract: Real-time perception, or streaming perception, is a crucial aspect of autonomous driving that has yet to be thoroughly explored in existing research. To address this gap, we present DAMO-StreamNet, an optimized framework that combines recent advances from the YOLO series with a comprehensive analysis of spatial and temporal perception mechanisms, delivering a cutting-edge solution. The key innovations of DAMO-StreamNet are (1) A robust neck structure incorporating deformable convolution, enhancing the receptive field and feature alignment capabilities (2) A dual-branch structure that integrates short-path semantic features and long-path temporal features, improving motion state prediction accuracy. (3) Logits-level distillation for efficient optimization, aligning the logits of teacher and student networks in semantic space. (4) A real-time forecasting mechanism that updates support frame features with the current frame, ensuring seamless streaming perception during inference. Our experiments demonstrate that DAMO-StreamNet surpasses existing state-of-the-art methods, achieving 37.8% (normal size (600, 960)) and 43.3% (large size (1200, 1920)) sAP without using extra data. This work not only sets a new benchmark for real-time perception but also provides valuable insights for future research. Additionally, DAMO-StreamNet can be applied to various autonomous systems, such as drones and robots, paving the way for real-time perception. The code is at https://github.com/zhiqic/DAMO-StreamNet.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Jun-Yan He (27 papers)
  2. Zhi-Qi Cheng (61 papers)
  3. Chenyang Li (71 papers)
  4. Wangmeng Xiang (19 papers)
  5. Binghui Chen (19 papers)
  6. Bin Luo (209 papers)
  7. Yifeng Geng (30 papers)
  8. Xuansong Xie (69 papers)
Citations (16)

Summary

We haven't generated a summary for this paper yet.