Bayesian Generative Active Deep Learning (1904.11643v1)

Published 26 Apr 2019 in cs.LG and stat.ML

Abstract: Deep learning models have demonstrated outstanding performance in several problems, but their training process tends to require immense amounts of computational and human resources for training and labeling, constraining the types of problems that can be tackled. Therefore, the design of effective training methods that require small labeled training sets is an important research direction that will allow a more effective use of resources.Among current approaches designed to address this issue, two are particularly interesting: data augmentation and active learning. Data augmentation achieves this goal by artificially generating new training points, while active learning relies on the selection of the "most informative" subset of unlabeled training samples to be labelled by an oracle. Although successful in practice, data augmentation can waste computational resources because it indiscriminately generates samples that are not guaranteed to be informative, and active learning selects a small subset of informative samples (from a large un-annotated set) that may be insufficient for the training process. In this paper, we propose a Bayesian generative active deep learning approach that combines active learning with data augmentation -- we provide theoretical and empirical evidence (MNIST, CIFAR-${10,100}$, and SVHN) that our approach has more efficient training and better classification results than data augmentation and active learning.

Authors (4)

Toan Tran (24 papers)
Thanh-Toan Do (92 papers)
Ian Reid (174 papers)
Gustavo Carneiro (129 papers)

Citations (128)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Bayesian Generative Active Deep Learning (1904.11643v1)

Summary

Related Papers