Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy (2306.05268v2)

Published 8 Jun 2023 in cs.LG, cs.AI, cs.CL, cs.CV, and cs.MM

Abstract: In a wide range of multimodal tasks, contrastive learning has become a particularly appealing approach since it can successfully learn representations from abundant unlabeled data with only pairing information (e.g., image-caption or video-audio pairs). Underpinning these approaches is the assumption of multi-view redundancy - that shared information between modalities is necessary and sufficient for downstream tasks. However, in many real-world settings, task-relevant information is also contained in modality-unique regions: information that is only present in one modality but still relevant to the task. How can we learn self-supervised multimodal representations to capture both shared and unique information relevant to downstream tasks? This paper proposes FactorCL, a new multimodal representation learning method to go beyond multi-view redundancy. FactorCL is built from three new contributions: (1) factorizing task-relevant information into shared and unique representations, (2) capturing task-relevant information via maximizing MI lower bounds and removing task-irrelevant information via minimizing MI upper bounds, and (3) multimodal data augmentations to approximate task relevance without labels. On large-scale real-world datasets, FactorCL captures both shared and unique information and achieves state-of-the-art results on six benchmarks

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Paul Pu Liang (103 papers)
  2. Zihao Deng (20 papers)
  3. Martin Ma (3 papers)
  4. James Zou (232 papers)
  5. Louis-Philippe Morency (123 papers)
  6. Ruslan Salakhutdinov (248 papers)
Citations (34)
Github Logo Streamline Icon: https://streamlinehq.com