2000 character limit reached
Contrastive Learning with Nasty Noise (2502.17872v1)
Published 25 Feb 2025 in cs.LG and cs.AI
Abstract: Contrastive learning has emerged as a powerful paradigm for self-supervised representation learning. This work analyzes the theoretical limits of contrastive learning under nasty noise, where an adversary modifies or replaces training samples. Using PAC learning and VC-dimension analysis, lower and upper bounds on sample complexity in adversarial settings are established. Additionally, data-dependent sample complexity bounds based on the l2-distance function are derived.