Incremental Self-training for Semi-supervised Learning (2404.12398v1)

Published 14 Apr 2024 in cs.LG

Abstract: Semi-supervised learning provides a solution to reduce the dependency of machine learning on labeled data. As one of the efficient semi-supervised techniques, self-training (ST) has received increasing attention. Several advancements have emerged to address challenges associated with noisy pseudo-labels. Previous works on self-training acknowledge the importance of unlabeled data but have not delved into their efficient utilization, nor have they paid attention to the problem of high time consumption caused by iterative learning. This paper proposes Incremental Self-training (IST) for semi-supervised learning to fill these gaps. Unlike ST, which processes all data indiscriminately, IST processes data in batches and priority assigns pseudo-labels to unlabeled samples with high certainty. Then, it processes the data around the decision boundary after the model is stabilized, enhancing classifier performance. Our IST is simple yet effective and fits existing self-training-based semi-supervised learning methods. We verify the proposed IST on five datasets and two types of backbone, effectively improving the recognition accuracy and learning speed. Significantly, it outperforms state-of-the-art competitors on three challenging image classification tasks.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (39)

Authors (4)

Jifeng Guo (5 papers)
Zhulin Liu (4 papers)
Tong Zhang (569 papers)
C. L. Philip Chen (49 papers)

Citations (1)

View on Semantic Scholar

Tweets

https://twitter.com/gastronomy/status/1782258986017784134

Incremental Self-training for Semi-supervised Learning (2404.12398v1)

Related Papers

Tweets