Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach (2010.07835v3)

Published 15 Oct 2020 in cs.CL and cs.LG

Abstract: Fine-tuned pre-trained LLMs (LMs) have achieved enormous success in many NLP tasks, but they still require excessive labeled data in the fine-tuning stage. We study the problem of fine-tuning pre-trained LMs using only weak supervision, without any labeled data. This problem is challenging because the high capacity of LMs makes them prone to overfitting the noisy labels generated by weak supervision. To address this problem, we develop a contrastive self-training framework, COSINE, to enable fine-tuning LMs with weak supervision. Underpinned by contrastive regularization and confidence-based reweighting, this contrastive self-training framework can gradually improve model fitting while effectively suppressing error propagation. Experiments on sequence, token, and sentence pair classification tasks show that our model outperforms the strongest baseline by large margins on 7 benchmarks in 6 tasks, and achieves competitive performance with fully-supervised fine-tuning methods.

Authors (6)

Yue Yu (343 papers)
Simiao Zuo (25 papers)
Haoming Jiang (52 papers)
Wendi Ren (3 papers)
Tuo Zhao (131 papers)
Chao Zhang (909 papers)

Citations (122)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach (2010.07835v3)

Summary

Related Papers