Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhancing Self-Training Methods (2301.07294v1)

Published 18 Jan 2023 in cs.LG, cs.AI, and cs.CV

Abstract: Semi-supervised learning approaches train on small sets of labeled data along with large sets of unlabeled data. Self-training is a semi-supervised teacher-student approach that often suffers from the problem of "confirmation bias" that occurs when the student model repeatedly overfits to incorrect pseudo-labels given by the teacher model for the unlabeled data. This bias impedes improvements in pseudo-label accuracy across self-training iterations, leading to unwanted saturation in model performance after just a few iterations. In this work, we describe multiple enhancements to improve the self-training pipeline to mitigate the effect of confirmation bias. We evaluate our enhancements over multiple datasets showing performance gains over existing self-training design choices. Finally, we also study the extendability of our enhanced approach to Open Set unlabeled data (containing classes not seen in labeled data).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Aswathnarayan Radhakrishnan (1 paper)
  2. Jim Davis (13 papers)
  3. Zachary Rabin (3 papers)
  4. Benjamin Lewis (5 papers)
  5. Matthew Scherreik (3 papers)
  6. Roman Ilin (6 papers)
Citations (1)