Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Revisiting Pretraining for Semi-Supervised Learning in the Low-Label Regime (2205.03001v1)

Published 6 May 2022 in cs.CV

Abstract: Semi-supervised learning (SSL) addresses the lack of labeled data by exploiting large unlabeled data through pseudolabeling. However, in the extremely low-label regime, pseudo labels could be incorrect, a.k.a. the confirmation bias, and the pseudo labels will in turn harm the network training. Recent studies combined finetuning (FT) from pretrained weights with SSL to mitigate the challenges and claimed superior results in the low-label regime. In this work, we first show that the better pretrained weights brought in by FT account for the state-of-the-art performance, and importantly that they are universally helpful to off-the-shelf semi-supervised learners. We further argue that direct finetuning from pretrained weights is suboptimal due to covariate shift and propose a contrastive target pretraining step to adapt model weights towards target dataset. We carried out extensive experiments on both classification and segmentation tasks by doing target pretraining then followed by semi-supervised finetuning. The promising results validate the efficacy of target pretraining for SSL, in particular in the low-label regime.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Xun Xu (64 papers)
  2. Jingyi Liao (20 papers)
  3. Lile Cai (4 papers)
  4. Manh Cuong Nguyen (21 papers)
  5. Kangkang Lu (7 papers)
  6. Wanyue Zhang (9 papers)
  7. Yasin Yazici (6 papers)
  8. Chuan Sheng Foo (15 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.