Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Contrastive Learning Is Spectral Clustering On Similarity Graph (2303.15103v4)

Published 27 Mar 2023 in cs.LG, cs.AI, and cs.CV

Abstract: Contrastive learning is a powerful self-supervised learning method, but we have a limited theoretical understanding of how it works and why it works. In this paper, we prove that contrastive learning with the standard InfoNCE loss is equivalent to spectral clustering on the similarity graph. Using this equivalence as the building block, we extend our analysis to the CLIP model and rigorously characterize how similar multi-modal objects are embedded together. Motivated by our theoretical insights, we introduce the Kernel-InfoNCE loss, incorporating mixtures of kernel functions that outperform the standard Gaussian kernel on several vision datasets. The code is available at https://github.com/yifanzhang-pro/Kernel-InfoNCE.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Zhiquan Tan (20 papers)
  2. Yifan Zhang (245 papers)
  3. Jingqin Yang (6 papers)
  4. Yang Yuan (52 papers)
Citations (15)

Summary

We haven't generated a summary for this paper yet.