Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning Video Object Segmentation from Unlabeled Videos (2003.05020v1)

Published 10 Mar 2020 in cs.CV

Abstract: We propose a new method for video object segmentation (VOS) that addresses object pattern learning from unlabeled videos, unlike most existing methods which rely heavily on extensive annotated data. We introduce a unified unsupervised/weakly supervised learning framework, called MuG, that comprehensively captures intrinsic properties of VOS at multiple granularities. Our approach can help advance understanding of visual patterns in VOS and significantly reduce annotation burden. With a carefully-designed architecture and strong representation learning ability, our learned model can be applied to diverse VOS settings, including object-level zero-shot VOS, instance-level zero-shot VOS, and one-shot VOS. Experiments demonstrate promising performance in these settings, as well as the potential of MuG in leveraging unlabeled data to further improve the segmentation accuracy.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Xiankai Lu (21 papers)
  2. Wenguan Wang (103 papers)
  3. Jianbing Shen (96 papers)
  4. Yu-Wing Tai (123 papers)
  5. David Crandall (54 papers)
  6. Steven C. H. Hoi (94 papers)
Citations (138)

Summary

We haven't generated a summary for this paper yet.