Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation (2305.13785v2)

Published 23 May 2023 in cs.CL

Abstract: Training or finetuning large-scale LLMs such as GPT-3 requires substantial computation resources, motivating recent efforts to explore parameter-efficient adaptation to downstream tasks. One practical area of research is to treat these models as black boxes and interact with them through their inference APIs. In this paper, we investigate how to optimize few-shot text classification without accessing the gradients of the LLMs. To achieve this, we treat the black-box model as a feature extractor and train a classifier with the augmented text data. Data augmentation is performed using prompt-based finetuning on an auxiliary LLM with a much smaller parameter size than the black-box model. Through extensive experiments on eight text classification datasets, we show that our approach, dubbed BT-Classifier, significantly outperforms state-of-the-art black-box few-shot learners and performs on par with methods that rely on full-model tuning.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Danqing Luo (3 papers)
  2. Chen Zhang (403 papers)
  3. Jiahui Xu (21 papers)
  4. Bin Wang (750 papers)
  5. Yiming Chen (106 papers)
  6. Yan Zhang (954 papers)
  7. Haizhou Li (285 papers)