AutoAugment: Learning Augmentation Policies from Data (1805.09501v3)

Published 24 May 2018 in cs.CV, cs.LG, and stat.ML

Abstract: Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations are manually designed. In this paper, we describe a simple procedure called AutoAugment to automatically search for improved data augmentation policies. In our implementation, we have designed a search space where a policy consists of many sub-policies, one of which is randomly chosen for each image in each mini-batch. A sub-policy consists of two operations, each operation being an image processing function such as translation, rotation, or shearing, and the probabilities and magnitudes with which the functions are applied. We use a search algorithm to find the best policy such that the neural network yields the highest validation accuracy on a target dataset. Our method achieves state-of-the-art accuracy on CIFAR-10, CIFAR-100, SVHN, and ImageNet (without additional data). On ImageNet, we attain a Top-1 accuracy of 83.5% which is 0.4% better than the previous record of 83.1%. On CIFAR-10, we achieve an error rate of 1.5%, which is 0.6% better than the previous state-of-the-art. Augmentation policies we find are transferable between datasets. The policy learned on ImageNet transfers well to achieve significant improvements on other datasets, such as Oxford Flowers, Caltech-101, Oxford-IIT Pets, FGVC Aircraft, and Stanford Cars.

View on arXiv

Authors (5)

Ekin D. Cubuk (37 papers)
Barret Zoph (38 papers)
Vijay Vasudevan (24 papers)
Quoc V. Le (128 papers)
Dandelion Mane (1 paper)

Citations (1,696)

View on Semantic Scholar

Summary

AutoAugment: Learning Augmentation Strategies from Data

The paper "AutoAugment: Learning Augmentation Strategies from Data" by Ekin D. Cubuk, Barret Zoph, Dandelion Mane, Vijay Vasudevan, and Quoc V. Le, presents a methodology for the automation of data augmentation processes in the domain of image classification. Neural networks in image classification require large and diversified datasets to achieve high accuracy. Data augmentation techniques are typically used to increase both the volume and diversity of the training dataset. Traditionally, these augmentation methods have been manually crafted, which may be suboptimal and labor-intensive. The authors propose an algorithm called AutoAugment, which automates the search for the best augmentation policies, achieving state-of-the-art results on standard datasets.

Key Contributions

The primary contribution of this work is the development of a search-based augmentation strategy known as AutoAugment. The authors design a search space wherein a policy consists of several sub-policies. Each sub-policy consists of two image processing operations, such as translation, rotation, or changes in brightness, and corresponding probabilities and magnitudes. This setup allows for a diverse and systematic augmentation of training images.

AutoAugment employs a search algorithm, specifically Reinforcement Learning (RL), to optimize the policy. The controller RNN (Recurrent Neural Network) samples different policies, which are then evaluated based on the validation accuracy achieved when training a neural network. The policy gradient method is used to update the controller, iteratively refining the augmentation policy.

Experimental Validation

The authors validate AutoAugment through extensive experiments on multiple datasets:

CIFAR-10 and CIFAR-100:
- On CIFAR-10, AutoAugment achieved an error rate of 1.5%, which is a significant reduction of 0.6% from the previous state-of-the-art.
- On CIFAR-100, the implementation resulted in an error rate of 10.7%, again the best reported result for this dataset.
SVHN (Street View House Numbers):
- AutoAugment demonstrated improvement by reducing the error rate from 1.3% to 1.0% on SVHN, surpassing prior methodologies.
ImageNet:
- A Top-1 accuracy of 83.5% was achieved, improving upon the previous best result by 0.4%. The augmentation policies derived from ImageNet were also found to be transferable, yielding improvements on other fine-grained datasets such as Oxford 102 Flowers, Caltech-101, and FGVC Aircraft.

Implications and Future Prospects

The practical implication of AutoAugment is substantial: it offers an automated solution for augmenting datasets, potentially saving significant manual effort and yielding better augmentation strategies than traditional methods. The success of AutoAugment across diverse datasets indicates its robustness and adaptability, making it a valuable tool for researchers and practitioners in computer vision and related fields.

From a theoretical standpoint, the use of reinforcement learning in optimizing data augmentation strategies represents an effective amalgamation of optimization techniques and deep learning methodologies, showcasing the potential of automated machine learning (AutoML).

Future Developments

Future work could focus on:

Exploring Other Search Algorithms: Investigating whether evolutionary strategies or other search algorithms could yield even more effective augmentation policies.
Transferability Studies: Further analysis on the transferability of policies across different domains can provide deeper insights and establish more general principles for data augmentation transfer.
Optimization Efficiency: Enhancing the efficiency of the search process to handle even larger datasets and more complex models could further improve the practicality of AutoAugment.

Conclusion

The authors of "AutoAugment: Learning Augmentation Strategies from Data" present a significant advancement in the automation of data augmentation processes. By leveraging reinforcement learning to optimize augmentation policies, AutoAugment achieves impressive performance improvements on major image classification benchmarks while demonstrating robust transferability across datasets. These results highlight the potential of AutoML techniques to streamline and enhance model training processes in machine learning and AI.

PDF Markdown

Related Papers

Adversarial AutoAugment (2019)
Fast AutoAugment (2019)
LA3: Efficient Label-Aware AutoAugment (2023)
Greedy AutoAugment (2019)
Learning data augmentation policies using augmented random search (2018)

Find Related Papers

Tweets

https://twitter.com/nrehiew_/status/1747802039500046455

YouTube

Show All Videos