Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LongShortNet: Exploring Temporal and Semantic Features Fusion in Streaming Perception (2210.15518v4)

Published 27 Oct 2022 in cs.CV, cs.AI, and cs.MM

Abstract: Streaming perception is a critical task in autonomous driving that requires balancing the latency and accuracy of the autopilot system. However, current methods for streaming perception are limited as they only rely on the current and adjacent two frames to learn movement patterns. This restricts their ability to model complex scenes, often resulting in poor detection results. To address this limitation, we propose LongShortNet, a novel dual-path network that captures long-term temporal motion and integrates it with short-term spatial semantics for real-time perception. LongShortNet is notable as it is the first work to extend long-term temporal modeling to streaming perception, enabling spatiotemporal feature fusion. We evaluate LongShortNet on the challenging Argoverse-HD dataset and demonstrate that it outperforms existing state-of-the-art methods with almost no additional computational cost.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Chenyang Li (71 papers)
  2. Zhi-Qi Cheng (61 papers)
  3. Jun-Yan He (27 papers)
  4. Pengyu Li (19 papers)
  5. Bin Luo (209 papers)
  6. Hanyuan Chen (6 papers)
  7. Yifeng Geng (30 papers)
  8. Jin-Peng Lan (7 papers)
  9. Xuansong Xie (69 papers)
Citations (10)