Learning Loss for Active Learning

Published 9 May 2019 in cs.CV and cs.LG | (1905.03677v1)

Abstract: The performance of deep neural networks improves with more annotated data. The problem is that the budget for annotation is limited. One solution to this is active learning, where a model asks human to annotate data that it perceived as uncertain. A variety of recent methods have been proposed to apply active learning to deep networks but most of them are either designed specific for their target tasks or computationally inefficient for large networks. In this paper, we propose a novel active learning method that is simple but task-agnostic, and works efficiently with the deep networks. We attach a small parametric module, named "loss prediction module," to a target network, and learn it to predict target losses of unlabeled inputs. Then, this module can suggest data that the target model is likely to produce a wrong prediction. This method is task-agnostic as networks are learned from a single loss regardless of target tasks. We rigorously validate our method through image classification, object detection, and human pose estimation, with the recent network architectures. The results demonstrate that our method consistently outperforms the previous methods over the tasks.

Abstract PDF Upgrade to Chat

Authors (2)

Citations (600)

View on Semantic Scholar

Summary

The paper demonstrates a novel loss prediction module that identifies high-error samples to optimize data annotation for deep neural networks.
The methodology involves joint training of a target network and a loss predictor, enabling application across image classification, object detection, and pose estimation tasks.
Empirical results show improvements in metrics like accuracy, mAP, and PCKh, evidencing reduced annotation efforts and enhanced performance.

Learning Loss for Active Learning: An Expert Overview

The paper "Learning Loss for Active Learning" addresses the critical issue of data annotation costs in deep neural networks by proposing an innovative approach to active learning. The primary contribution is a task-agnostic method that enhances model performance with a minimal amount of annotated data. This is achieved through the introduction of a "loss prediction module" which efficiently identifies data points that are likely to result in high losses, thereby targeting the most informative samples for annotation.

Methodology

The authors attach a small, parametric loss prediction module to a target neural network. This module predicts the expected loss of unlabeled inputs, guiding the selection of data points that might lead to erroneous predictions by the main model. Unlike previous methods, which are often task-specific and computationally intensive, this approach is both simple and applicable across various tasks without the need for specific engineering efforts.

The procedure involves initializing with a small annotated dataset and iteratively expanding it by selecting additional data points based on predicted loss values. The target model and the loss prediction module are jointly trained, minimizing a loss function that includes both the target loss and the loss prediction error. This joint training is designed to efficiently work with contemporary deep network architectures while adapting to different task requirements.

Numerical Results

The efficacy of this novel method is rigorously validated across three diverse tasks: image classification, object detection, and human pose estimation. Each task demonstrated superior performance using this approach compared to traditional methods such as random sampling, entropy-based sampling, and core-set based strategies.

For image classification on the CIFAR-10 dataset, the proposed method achieved an accuracy of 91.01%, surpassing the entropy-based and core-set alternatives by 0.42% and 0.91% respectively. In object detection using the PASCAL VOC dataset, the method improved mean average precision (mAP) by substantial margins, outperforming the baseline by 2.21%, and the next best method by over 1%. In the domain of human pose estimation on the MPII dataset, the method consistently achieved higher [email protected] scores, signifying more accurate pose predictions compared to existing approaches.

Implications and Future Directions

The implications of this research are significant both practically and theoretically. By providing a scalable and task-agnostic solution, this method reduces engineering efforts and computational costs associated with annotation in different domains. The ability to generalize across various tasks without extensive modifications indicates potential applications in fields requiring large annotated datasets, such as biomedical imaging or autonomous driving.

Looking forward, the integration of data diversity and density considerations could further refine the selection process, potentially leading to even more efficient learning cycles. Moreover, developing enhanced architectures for the loss prediction module could improve accuracy in more complex tasks like detecting fine-grained features in large-scale datasets.

Conclusion

The paper presents a well-founded and empirically validated method for active learning through "learning loss", offering a substantial contribution to resource-efficient training of deep neural networks. This approach holds promise for reducing annotation burdens across diverse applications, encouraging further exploration and development in task-agnostic learning methods.

Markdown Report Issue