ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios (2405.10808v1)

Published 17 May 2024 in cs.CL

Abstract: Active learning is designed to minimize annotation efforts by prioritizing instances that most enhance learning. However, many active learning strategies struggle with a 'cold start' problem, needing substantial initial data to be effective. This limitation often reduces their utility for pre-trained models, which already perform well in few-shot scenarios. To address this, we introduce ActiveLLM, a novel active learning approach that leverages LLMs such as GPT-4, Llama 3, and Mistral Large for selecting instances. We demonstrate that ActiveLLM significantly enhances the classification performance of BERT classifiers in few-shot scenarios, outperforming both traditional active learning methods and the few-shot learning method SetFit. Additionally, ActiveLLM can be extended to non-few-shot scenarios, allowing for iterative selections. In this way, ActiveLLM can even help other active learning strategies to overcome their cold start problem. Our results suggest that ActiveLLM offers a promising solution for improving model performance across various learning setups.

References (36)

Authors (2)

Markus Bayer (8 papers)
Christian Reuter (14 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/Marcello_AI/status/1874515523272388665

ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios (2405.10808v1)

Summary

Related Papers

Tweets