POUF: Prompt-oriented unsupervised fine-tuning for large pre-trained models (2305.00350v1)

Published 29 Apr 2023 in cs.LG, cs.AI, cs.CL, cs.CV, and stat.ML

Abstract: Through prompting, large-scale pre-trained models have become more expressive and powerful, gaining significant attention in recent years. Though these big models have zero-shot capabilities, in general, labeled data are still required to adapt them to downstream tasks. To overcome this critical limitation, we propose an unsupervised fine-tuning framework to directly fine-tune the model or prompt on the unlabeled target data. We demonstrate how to apply our method to both language-augmented vision and masked-LLMs by aligning the discrete distributions extracted from the prompts and target data. To verify our approach's applicability, we conduct extensive experiments on image classification, sentiment analysis, and natural language inference tasks. Across 13 image-related tasks and 15 language-related ones, the proposed approach achieves consistent improvements over the baselines.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (5)

Korawat Tanwisuth (7 papers)
Shujian Zhang (28 papers)
Huangjie Zheng (34 papers)
Pengcheng He (60 papers)
Mingyuan Zhou (161 papers)

Citations (23)

View on Semantic Scholar

POUF: Prompt-oriented unsupervised fine-tuning for large pre-trained models (2305.00350v1)

Related Papers