Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency (2302.10307v1)

Published 31 Jan 2023 in cs.CV, cs.AI, and cs.CL

Abstract: Recently, great success has been made in learning visual representations from text supervision, facilitating the emergence of text-supervised semantic segmentation. However, existing works focus on pixel grouping and cross-modal semantic alignment, while ignoring the correspondence among multiple augmented views of the same image. To overcome such limitation, we propose multi-\textbf{View} \textbf{Co}nsistent learning (ViewCo) for text-supervised semantic segmentation. Specifically, we first propose text-to-views consistency modeling to learn correspondence for multiple views of the same input image. Additionally, we propose cross-view segmentation consistency modeling to address the ambiguity issue of text supervision by contrasting the segment features of Siamese visual encoders. The text-to-views consistency benefits the dense assignment of the visual features by encouraging different crops to align with the same text, while the cross-view segmentation consistency modeling provides additional self-supervision, overcoming the limitation of ambiguous text supervision for segmentation masks. Trained with large-scale image-text data, our model can directly segment objects of arbitrary categories in a zero-shot manner. Extensive experiments show that ViewCo outperforms state-of-the-art methods on average by up to 2.9\%, 1.6\%, and 2.4\% mIoU on PASCAL VOC2012, PASCAL Context, and COCO, respectively.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Pengzhen Ren (15 papers)
  2. Changlin Li (28 papers)
  3. Hang Xu (205 papers)
  4. Yi Zhu (233 papers)
  5. Guangrun Wang (43 papers)
  6. Jianzhuang Liu (91 papers)
  7. Xiaojun Chang (148 papers)
  8. Xiaodan Liang (318 papers)
Citations (39)

Summary

We haven't generated a summary for this paper yet.