Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in Clustering (2103.17070v1)

Published 30 Mar 2021 in cs.CV

Abstract: We present a new framework for semantic segmentation without annotations via clustering. Off-the-shelf clustering methods are limited to curated, single-label, and object-centric images yet real-world data are dominantly uncurated, multi-label, and scene-centric. We extend clustering from images to pixels and assign separate cluster membership to different instances within each image. However, solely relying on pixel-wise feature similarity fails to learn high-level semantic concepts and overfits to low-level visual cues. We propose a method to incorporate geometric consistency as an inductive bias to learn invariance and equivariance for photometric and geometric variations. With our novel learning objective, our framework can learn high-level semantic concepts. Our method, PiCIE (Pixel-level feature Clustering using Invariance and Equivariance), is the first method capable of segmenting both things and stuff categories without any hyperparameter tuning or task-specific pre-processing. Our method largely outperforms existing baselines on COCO and Cityscapes with +17.5 Acc. and +4.5 mIoU. We show that PiCIE gives a better initialization for standard supervised training. The code is available at https://github.com/janghyuncho/PiCIE.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Jang Hyun Cho (9 papers)
  2. Utkarsh Mall (12 papers)
  3. Kavita Bala (30 papers)
  4. Bharath Hariharan (82 papers)
Citations (178)

Summary

  • The paper demonstrates that applying invariance and equivariance principles in clustering enhances unsupervised semantic segmentation.
  • It introduces the novel PiCIE framework, which clusters image features for increased consistency and discrimination.
  • Experimental results show that PiCIE outperforms existing methods on critical segmentation benchmarks.

Overview of \LaTeX\ Guidelines for Author Responses

The paper "\LaTeX\ Guidelines for Author Response" offers a comprehensive set of instructions aimed at authors preparing a rebuttal document for CVPR conferences. This paper delineates the constraints and stylistic requirements for submitting a response to reviewers' comments after the initial paper review process. The overall structure of the author rebuttal is highly regimented, emphasizing conciseness and adherence to strict formatting standards.

Submission Protocol

The rebuttal process is a critical component of the paper submission cycle in academic conferences, allowing authors to address factual errors or provide clarifications on points highlighted by reviewers. Importantly, the rebuttal is constrained to a one-page PDF, thus demanding clear and succinct communication from the authors. Through this framework, authors can append supplementary information such as figures or proofs, though these must relate directly to the core content of the original submission and cannot introduce novel research contributions.

The paper underscores that no new experiments or computational results should be presented in the rebuttal. This stipulation aligns with the 2018 PAMI-TC policies, which aim to alleviate the burden on authors to produce additional experimental data during the rebuttal phase. This guideline ensures fairness and consistency in the review process, preventing any penalization of authors who choose not to present new experimental findings in their rebuttals.

Formatting Standards

Formatting precision is paramount in preparing the author response. The document must adhere to a two-column layout with standardized font sizes and spacing. Section numbering, display equations, and figure captions have specified format criteria, ensuring uniformity and enhancing readability. Figures, if included, should be sized appropriately to maintain clarity when printed, as many reviewers opt to review hard copies rather than digital versions.

These formatting instructions facilitate a streamlined presentation that allows readers, particularly reviewers, to access pertinent information efficiently without navigating through extraneous content or inconsistent layouts.

Practical and Theoretical Implications

From a practical standpoint, this paper formalizes the rebuttal procedure, contributing to a robust and systematic submission framework for CVPR conferences. This has implications for the conference's ability to uphold academic rigor and integrity throughout the review process, leading to more refined and polished submissions.

Theoretically, the standardization of author responses can influence how authors approach feedback and criticism, fostering a culture of engagement and constructive dialogue within the research community. By focusing responses on factual correctness and clarifications, rather than the introduction of new experiments, researchers are encouraged to improve their communication skills and argumentative precision.

Future Developments

Looking forward, the outlined guidelines may set a precedent for other conferences and journals, promoting a uniform rebuttal protocol across various platforms. Furthermore, as digital submission systems evolve, there may be opportunities to streamline compliance with these guidelines through automated formatting tools or plugins, potentially reducing the administrative burden on authors without compromising on the quality of submissions.

In conclusion, these guidelines serve as crucial instructions for authors aiming to refine their engagement in the scholarly review process, emphasizing clarity, precision, and adherence to formatting standards in the preparation of author responses.