Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation (2305.13785v2)
Abstract: Training or finetuning large-scale LLMs such as GPT-3 requires substantial computation resources, motivating recent efforts to explore parameter-efficient adaptation to downstream tasks. One practical area of research is to treat these models as black boxes and interact with them through their inference APIs. In this paper, we investigate how to optimize few-shot text classification without accessing the gradients of the LLMs. To achieve this, we treat the black-box model as a feature extractor and train a classifier with the augmented text data. Data augmentation is performed using prompt-based finetuning on an auxiliary LLM with a much smaller parameter size than the black-box model. Through extensive experiments on eight text classification datasets, we show that our approach, dubbed BT-Classifier, significantly outperforms state-of-the-art black-box few-shot learners and performs on par with methods that rely on full-model tuning.
- Danqing Luo (3 papers)
- Chen Zhang (403 papers)
- Jiahui Xu (21 papers)
- Bin Wang (750 papers)
- Yiming Chen (106 papers)
- Yan Zhang (954 papers)
- Haizhou Li (285 papers)