BAL: Balancing Diversity and Novelty for Active Learning (2312.15944v1)

Published 26 Dec 2023 in cs.LG and cs.CV

Abstract: The objective of Active Learning is to strategically label a subset of the dataset to maximize performance within a predetermined labeling budget. In this study, we harness features acquired through self-supervised learning. We introduce a straightforward yet potent metric, Cluster Distance Difference, to identify diverse data. Subsequently, we introduce a novel framework, Balancing Active Learning (BAL), which constructs adaptive sub-pools to balance diverse and uncertain data. Our approach outperforms all established active learning methods on widely recognized benchmarks by 1.20%. Moreover, we assess the efficacy of our proposed framework under extended settings, encompassing both larger and smaller labeling budgets. Experimental results demonstrate that, when labeling 80% of the samples, the performance of the current SOTA method declines by 0.74%, whereas our proposed BAL achieves performance comparable to the full dataset. Codes are available at https://github.com/JulietLJY/BAL.

References (47)

Citations (7)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Authors (5)

GitHub

GitHub - JulietLJY/BAL: BAL: Balancing Diversity and Novelty for Active Learning - Official Pytorch Implementation

BAL: Balancing Diversity and Novelty for Active Learning (2312.15944v1)

Summary

Follow-up Questions

Related Papers

Authors (5)

GitHub