kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies (2404.09447v3)

Published 15 Apr 2024 in cs.CV and cs.LG

Abstract: Continual segmentation has not yet tackled the challenge of improving open-vocabulary segmentation models with training data for accurate segmentation across large, continually expanding vocabularies. We discover that traditional continual training results in severe catastrophic forgetting, failing to outperform a zero-shot segmentation baseline. We introduce a novel training-free strategy, kNN-CLIP, which augments the model with a database of instance embeddings for semantic and panoptic segmentation that achieves zero forgetting. We demonstrate that kNN-CLIP can adapt to continually growing vocabularies without the need for retraining or large memory costs. kNN-CLIP enables open-vocabulary segmentation methods to expand their vocabularies on any domain with a single pass through the data, while only storing compact embeddings. This approach minimizes both compute and memory costs. kNN-CLIP achieves state-of-the-art performance across large-vocabulary semantic and panoptic segmentation datasets. We hope kNN-CLIP represents a significant step forward in enabling more efficient and adaptable continual segmentation, paving the way for advances in real-world large-vocabulary continual segmentation methods.

References (79)

Authors (8)

Zhongrui Gui (2 papers)
Shuyang Sun (25 papers)
Runjia Li (16 papers)
Jianhao Yuan (10 papers)
Zhaochong An (11 papers)
Karsten Roth (36 papers)
Ameya Prabhu (37 papers)
Philip Torr (172 papers)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/CSVisionPapers/status/1780642884624007218

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies (2404.09447v3)

Summary

Related Papers

Tweets