Plug and Play Active Learning for Object Detection (2211.11612v2)
Abstract: Annotating datasets for object detection is an expensive and time-consuming endeavor. To minimize this burden, active learning (AL) techniques are employed to select the most informative samples for annotation within a constrained "annotation budget". Traditional AL strategies typically rely on model uncertainty or sample diversity for query sampling, while more advanced methods have focused on developing AL-specific object detector architectures to enhance performance. However, these specialized approaches are not readily adaptable to different object detectors due to the significant engineering effort required for integration. To overcome this challenge, we introduce Plug and Play Active Learning (PPAL), a simple and effective AL strategy for object detection. PPAL is a two-stage method comprising uncertainty-based and diversity-based sampling phases. In the first stage, our Difficulty Calibrated Uncertainty Sampling leverage a category-wise difficulty coefficient that combines both classification and localisation difficulties to re-weight instance uncertainties, from which we sample a candidate pool for the subsequent diversity-based sampling. In the second stage, we propose Category Conditioned Matching Similarity to better compute the similarities of multi-instance images as ensembles of their instance similarities, which is used by the k-Means++ algorithm to sample the final AL queries. PPAL makes no change to model architectures or detector training pipelines; hence it can be easily generalized to different object detectors. We benchmark PPAL on the MS-COCO and Pascal VOC datasets using different detector architectures and show that our method outperforms prior work by a large margin. Code is available at https://github.com/ChenhongyiYang/PPAL
- Contextual diversity for active learning. In ECCV, 2020.
- Deep batch active learning by diverse, uncertain gradient lower bounds. ICLR, 2020.
- Link-based active learning. In NeurIPS Workshop on Analyzing Networks and Learning with Graphs, 2009.
- Learning a Unified Sample Weighting Network for Object Detection. In CVPR, 2020.
- Cascade r-cnn: Delving into high quality object detection. In CVPR, 2018.
- End-to-End Object Detection with Transformers. ECCV, 2020.
- Hybrid task cascade for instance segmentation. In CVPR, 2019.
- MMDetection: Open mmlab detection toolbox and benchmark. arXiv preprint arXiv:1906.07155, 2019.
- Disentangle your dense object detector. In ACM Multimedia, 2021.
- Active learning for deep object detection via probabilistic modeling. In ICCV, 2021.
- Batch active learning at scale. ICLR, 2021.
- Not all labels are equal: Rationalizing the labeling costs for training object detection. In CVPR, 2022.
- A convex optimization framework for active learning. In ICCV, 2013.
- The Pascal Visual Object Classes Challenge: A Retrospective. IJCV, 2015.
- Deep bayesian active learning with image data. In ICML. PMLR, 2017.
- Yuhong Guo. Active instance sampling via matrix partition. NeurIPS, 2010.
- Mask r-cnn. In ICCV, 2017.
- Deep residual learning for image recognition. In CVPR, 2016.
- DenseBox: Unifying landmark localization with end to end object detection. arXiv preprint arXiv:1509.04874, 2015.
- Multi-class active learning for image classification. In CVPR, 2009.
- The open images dataset v4. IJCV, 128(7):1956–1981, 2020.
- Heterogeneous uncertainty sampling for supervised learning. In Machine learning proceedings, pages 148–156. Elsevier, 1994.
- A sequential algorithm for training text classifiers. In SIGIR, pages 3–12. Springer, 1994.
- Dn-detr: Accelerate detr training by introducing query denoising. In CVPR, 2022.
- Focal loss for dense object detection. In ICCV, 2017.
- Microsoft coco: Common objects in context. In ECCV, 2014.
- Ssd: Single shot multibox detector. In ECCV, 2016.
- Swin transformer: Hierarchical vision transformer using shifted windows. In ICCV, pages 10012–10022, 2021.
- Latent structured active learning. NeurIPS, 2013.
- Box-level active detection. In CVPR, 2023.
- Hierarchical subquery evaluation for active learning on a graph. In CVPR, 2014.
- Active domain adaptation via clustering uncertainty-weighted embeddings. In ICCV, 2021.
- Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767, 2018.
- Faster r-cnn: Towards real-time object detection with region proposal networks. In NeurIPS, 2015.
- Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression. In CVPR, 2019.
- Margin-based active learning for structured output spaces. In ECML. Springer, 2006.
- ImageNet Large Scale Visual Recognition Challenge. IJCV, 2015.
- Mobilenetv2: Inverted Residuals and Linear Bottlenecks. In CVPR, 2018.
- Active learning for convolutional neural networks: A core-set approach. ICLR, 2018.
- Burr Settles. Active learning. Synthesis lectures on artificial intelligence and machine learning, 6(1):1–114, 2012.
- Fcos: Fully convolutional one-stage object detection. In ICCV, 2019.
- Cost-effective active learning for deep image classification. T-CSVT, 27(12):2591–2600, 2016.
- Entropy-based active learning for object detection with progressive diversity constraint. In CVPR, 2022.
- Redal: Region-based and diversity-aware active learning for point cloud semantic segmentation. In ICCV, 2021.
- End-to-end semi-supervised object detection with soft teacher. In ICCV, pages 3060–3069, October 2021.
- Contrastive object-level pre-training with spatial noise curriculum learning. arXiv preprint arXiv:2111.13651, 2021.
- Multi-class active learning by uncertainty sampling with diversity maximization. IJCV, 113(2):113–127, 2015.
- Learning loss for active learning. In CVPR, 2019.
- Consistency-based active learning for object detection. In CVPR Workshops, 2022.
- Multiple instance active learning for object detection. In CVPR, 2021.
- Dino: Detr with improved denoising anchor boxes for end-to-end object detection. ECCV, 2022.
- Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. In CVPR, 2020.
- Fedor Zhdanov. Diverse mini-batch active learning. arXiv preprint arXiv:1901.05954, 2019.
- Deformable detr: Deformable transformers for end-to-end object detection. In ICLR, 2020.
- Chenhongyi Yang (14 papers)
- Lichao Huang (28 papers)
- Elliot J. Crowley (27 papers)