Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ActiveGLAE: A Benchmark for Deep Active Learning with Transformers (2306.10087v1)

Published 16 Jun 2023 in cs.LG and cs.AI

Abstract: Deep active learning (DAL) seeks to reduce annotation costs by enabling the model to actively query instance annotations from which it expects to learn the most. Despite extensive research, there is currently no standardized evaluation protocol for transformer-based LLMs in the field of DAL. Diverse experimental settings lead to difficulties in comparing research and deriving recommendations for practitioners. To tackle this challenge, we propose the ActiveGLAE benchmark, a comprehensive collection of data sets and evaluation guidelines for assessing DAL. Our benchmark aims to facilitate and streamline the evaluation process of novel DAL strategies. Additionally, we provide an extensive overview of current practice in DAL with transformer-based LLMs. We identify three key challenges - data set selection, model training, and DAL settings - that pose difficulties in comparing query strategies. We establish baseline results through an extensive set of experiments as a reference point for evaluating future work. Based on our findings, we provide guidelines for researchers and practitioners.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Lukas Rauch (12 papers)
  2. Matthias Aßenmacher (20 papers)
  3. Denis Huseljic (16 papers)
  4. Moritz Wirth (8 papers)
  5. Bernd Bischl (136 papers)
  6. Bernhard Sick (97 papers)
Citations (11)

Summary

We haven't generated a summary for this paper yet.