Smart Augmentation - Learning an Optimal Data Augmentation Strategy (1703.08383v1)

Published 24 Mar 2017 in cs.AI, cs.LG, and stat.ML

Abstract: A recurring problem faced when training neural networks is that there is typically not enough data to maximize the generalization capability of deep neural networks(DNN). There are many techniques to address this, including data augmentation, dropout, and transfer learning. In this paper, we introduce an additional method which we call Smart Augmentation and we show how to use it to increase the accuracy and reduce overfitting on a target network. Smart Augmentation works by creating a network that learns how to generate augmented data during the training process of a target network in a way that reduces that networks loss. This allows us to learn augmentations that minimize the error of that network. Smart Augmentation has shown the potential to increase accuracy by demonstrably significant measures on all datasets tested. In addition, it has shown potential to achieve similar or improved performance levels with significantly smaller network sizes in a number of tested cases.

Authors (3)

Joseph Lemley (15 papers)
Shabab Bazrafkan (13 papers)
Peter Corcoran (54 papers)

Citations (361)

View on Semantic Scholar

Summary

Smart Augmentation: An Approach to Automated Data Enhancement in Neural Network Training

The paper "Smart Augmentation: Learning an Optimal Data Augmentation Strategy" by Joseph Lemley, Shabab Bazrafkan, and Peter Corcoran introduces a novel method named Smart Augmentation which aims to enhance the generalization capabilities of Deep Neural Networks (DNNs) by automating the data augmentation process. The research addresses the persistent challenge of overfitting due to limited training data and explores how Smart Augmentation can dynamically improve regularization during the training of a neural network.

Overview

The authors propose Smart Augmentation as an additional regularization method that can be integrated into the existing arsenal of techniques like dropout, batch normalization, and transfer learning. Traditional augmentation methods often rely on expert intuition and trial, incorporating rotation, scaling, and noise addition to datasets. However, these are typically applied indiscriminately and might not always enhance performance.

Smart Augmentation differentiates itself by constructing an augmentation-generating network (network A) that learns to merge several samples from the same data class to create new data instances that help a target network (network B) minimize error during training. Network A leverages granularity from within the class-specific mutual information, thus releasing the potential for a broader set of features that can improve the robustness of network B.

Methodology and Experimental Results

The researchers articulated their approach through several experiments involving multiple datasets: the AR Faces, FERET, and Adience datasets for gender classification, as well as the MIT Places dataset for scene classification. The methodology involves training network A to map multiple input samples into a single output that best enhances network B's performance.

Single vs. Multiple Network Augmentation: The paper compares setups with a single augmentation network A versus configurations that employ multiple A networks specific to each data class. It was found that multiple network A configurations had a marginal advantage in learning class-specific augmentations.
Impact on Overfitting and Network Size Reduction: Smart Augmentation was shown to notably reduce overfitting, evidenced by a decreased difference between training and validation loss when compared with networks trained with traditional augmentation strategies. Moreover, it allowed for smaller networks to achieve accuracies comparable to or surpassing those of significantly larger architectures like VGG16, highlighting implications for efficient implementation in computationally constrained environments.
Parameter Tuning: Various combinations of input channels for network A were tested, but the results didn't indicate a strong linear relationship between channel number and accuracy improvement. The experiments also adjusted loss function parameters (α and β), showing some sensitivity but with no consistent pattern pointing to an optimal setting.

Implications and Future Directions

The introduction of Smart Augmentation suggests profound implications for the future of neural network training, particularly in domains experiencing data scarcity. By automating the augmentation process, this approach could simplify model development pipelines and improve DNN performance on unseen data without depending extensively on manual data preprocessing.

For theoretical implications, Smart Augmentation reinforces the notion that learning data representations that capture inter-sample relationships can enhance the learning process beyond simple feature extraction from individual samples. This aligns with the broader movement within artificial intelligence towards more robust and adaptive learning frameworks.

Looking forward, further research could explore extending Smart Augmentation's applicability to more complex, multi-class environments and richer datasets. Additionally, clarifying the interplay between network A configurations and dataset characteristics could pave the way for a more generalized framework that determines optimal channel requirements and parameter settings for varied data types.

Smart Augmentation marks a significant advancement in the automated augmentation of datasets for neural networks, consistently reducing the dependency on manual data processing efforts while simultaneously enhancing model accuracy and robustness.

PDF Markdown