Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Semi-supervised Batch Active Learning via Bilevel Optimization (2010.09654v1)

Published 19 Oct 2020 in cs.LG, cs.SD, and stat.ML

Abstract: Active learning is an effective technique for reducing the labeling cost by improving data efficiency. In this work, we propose a novel batch acquisition strategy for active learning in the setting where the model training is performed in a semi-supervised manner. We formulate our approach as a data summarization problem via bilevel optimization, where the queried batch consists of the points that best summarize the unlabeled data pool. We show that our method is highly effective in keyword detection tasks in the regime when only few labeled samples are available.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Zalán Borsos (18 papers)
  2. Marco Tagliasacchi (37 papers)
  3. Andreas Krause (269 papers)
Citations (23)

Summary

We haven't generated a summary for this paper yet.