Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Joint-task Self-supervised Learning for Temporal Correspondence (1909.11895v1)

Published 26 Sep 2019 in cs.CV

Abstract: This paper proposes to learn reliable dense correspondence from videos in a self-supervised manner. Our learning process integrates two highly related tasks: tracking large image regions \emph{and} establishing fine-grained pixel-level associations between consecutive video frames. We exploit the synergy between both tasks through a shared inter-frame affinity matrix, which simultaneously models transitions between video frames at both the region- and pixel-levels. While region-level localization helps reduce ambiguities in fine-grained matching by narrowing down search regions; fine-grained matching provides bottom-up features to facilitate region-level localization. Our method outperforms the state-of-the-art self-supervised methods on a variety of visual correspondence tasks, including video-object and part-segmentation propagation, keypoint tracking, and object tracking. Our self-supervised method even surpasses the fully-supervised affinity feature representation obtained from a ResNet-18 pre-trained on the ImageNet.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Xueting Li (32 papers)
  2. Sifei Liu (64 papers)
  3. Shalini De Mello (45 papers)
  4. Xiaolong Wang (243 papers)
  5. Jan Kautz (215 papers)
  6. Ming-Hsuan Yang (377 papers)
Citations (133)

Summary

We haven't generated a summary for this paper yet.