Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

KeepAugment: A Simple Information-Preserving Data Augmentation Approach (2011.11778v1)

Published 23 Nov 2020 in cs.CV

Abstract: Data augmentation (DA) is an essential technique for training state-of-the-art deep learning systems. In this paper, we empirically show data augmentation might introduce noisy augmented examples and consequently hurt the performance on unaugmented data during inference. To alleviate this issue, we propose a simple yet highly effective approach, dubbed \emph{KeepAugment}, to increase augmented images fidelity. The idea is first to use the saliency map to detect important regions on the original images and then preserve these informative regions during augmentation. This information-preserving strategy allows us to generate more faithful training examples. Empirically, we demonstrate our method significantly improves on a number of prior art data augmentation schemes, e.g. AutoAugment, Cutout, random erasing, achieving promising results on image classification, semi-supervised image classification, multi-view multi-camera tracking and object detection.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Chengyue Gong (30 papers)
  2. Dilin Wang (37 papers)
  3. Meng Li (244 papers)
  4. Vikas Chandra (75 papers)
  5. Qiang Liu (405 papers)
Citations (100)

Summary

We haven't generated a summary for this paper yet.