Zero-Shot Text Classification via Self-Supervised Tuning (2305.11442v2)

Published 19 May 2023 in cs.CL, cs.AI, and cs.LG

Abstract: Existing solutions to zero-shot text classification either conduct prompting with pre-trained LLMs, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the LLMs with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pre-trained models are publicly available at https://github.com/DAMO-NLP-SG/SSTuning .

Authors (7)

Chaoqun Liu (38 papers)
Wenxuan Zhang (75 papers)
Guizhen Chen (11 papers)
Xiaobao Wu (43 papers)
Anh Tuan Luu (69 papers)
Chip Hong Chang (4 papers)
Lidong Bing (144 papers)

Citations (10)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - DAMO-NLP-SG/SSTuning: Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning" (23 stars)

Zero-Shot Text Classification via Self-Supervised Tuning (2305.11442v2)

Summary

Related Papers

GitHub