Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Self-supervised Object Motion and Depth Estimation from Video (1912.04250v2)

Published 9 Dec 2019 in cs.CV

Abstract: We present a self-supervised learning framework to estimate the individual object motion and monocular depth from video. We model the object motion as a 6 degree-of-freedom rigid-body transformation. The instance segmentation mask is leveraged to introduce the information of object. Compared with methods which predict dense optical flow map to model the motion, our approach significantly reduces the number of values to be estimated. Our system eliminates the scale ambiguity of motion prediction through imposing a novel geometric constraint loss term. Experiments on KITTI driving dataset demonstrate our system is capable to capture the object motion without external annotation. Our system outperforms previous self-supervised approaches in terms of 3D scene flow prediction, and contribute to the disparity prediction in dynamic area.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Qi Dai (58 papers)
  2. Vaishakh Patil (9 papers)
  3. Simon Hecker (8 papers)
  4. Dengxin Dai (99 papers)
  5. Luc Van Gool (570 papers)
  6. Konrad Schindler (132 papers)
Citations (40)

Summary

We haven't generated a summary for this paper yet.