Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction (2407.07587v3)

Published 10 Jul 2024 in cs.CV

Abstract: Accurate perception of the dynamic environment is a fundamental task for autonomous driving and robot systems. This paper introduces Let Occ Flow, the first self-supervised work for joint 3D occupancy and occupancy flow prediction using only camera inputs, eliminating the need for 3D annotations. Utilizing TPV for unified scene representation and deformable attention layers for feature aggregation, our approach incorporates a novel attention-based temporal fusion module to capture dynamic object dependencies, followed by a 3D refine module for fine-gained volumetric representation. Besides, our method extends differentiable rendering to 3D volumetric flow fields, leveraging zero-shot 2D segmentation and optical flow cues for dynamic decomposition and motion optimization. Extensive experiments on nuScenes and KITTI datasets demonstrate the competitive performance of our approach over prior state-of-the-art methods. Our project page is available at https://eliliu2233.github.io/letoccflow/

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Yili Liu (5 papers)
  2. Linzhan Mou (8 papers)
  3. Xuan Yu (13 papers)
  4. Chenrui Han (3 papers)
  5. Sitong Mao (11 papers)
  6. Rong Xiong (115 papers)
  7. Yue Wang (676 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.