Improving Model Generalization by On-manifold Adversarial Augmentation in the Frequency Domain (2302.14302v3)
Abstract: Deep neural networks (DNNs) may suffer from significantly degenerated performance when the training and test data are of different underlying distributions. Despite the importance of model generalization to out-of-distribution (OOD) data, the accuracy of state-of-the-art (SOTA) models on OOD data can plummet. Recent work has demonstrated that regular or off-manifold adversarial examples, as a special case of data augmentation, can be used to improve OOD generalization. Inspired by this, we theoretically prove that on-manifold adversarial examples can better benefit OOD generalization. Nevertheless, it is nontrivial to generate on-manifold adversarial examples because the real manifold is generally complex. To address this issue, we proposed a novel method of Augmenting data with Adversarial examples via a Wavelet module (AdvWavAug), an on-manifold adversarial data augmentation technique that is simple to implement. In particular, we project a benign image into a wavelet domain. With the assistance of the sparsity characteristic of wavelet transformation, we can modify an image on the estimated data manifold. We conduct adversarial augmentation based on AdvProp training framework. Extensive experiments on different models and different datasets, including ImageNet and its distorted versions, demonstrate that our method can improve model generalization, especially on OOD data. By integrating AdvWavAug into the training process, we have achieved SOTA results on some recent transformer-based models.
- Defense against universal adversarial perturbations, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3389–3398.
- Wasserstein generative adversarial networks, in: International conference on machine learning, PMLR. pp. 214–223.
- Defending against image corruptions through adversarial augmentations. arXiv preprint arXiv:2104.01086 .
- Towards evaluating the robustness of neural networks, in: 2017 IEEE Symposium on Security and Privacy (SP), IEEE. pp. 39–57.
- Amplitude-phase recombination: Rethinking robustness of convolutional neural networks in frequency domain, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 458–467.
- The measure and mismeasure of fairness: A critical review of fair machine learning. arXiv preprint arXiv:1808.00023 .
- Autoaugment: Learning augmentation strategies from data, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 113–123.
- Randaugment: Practical automated data augmentation with a reduced search space, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 702–703.
- Shield: Fast, practical defense and vaccination for deep learning using jpeg compression, in: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 196–204.
- Ten lectures on wavelets.
- Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552 .
- An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 .
- Similarity and generalization: From noise to corruption. arXiv preprint arXiv:2201.12803 .
- Generalisation in humans and deep neural networks. arXiv preprint arXiv:1808.08750 .
- Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 .
- Mixup as locally linear out-of-manifold regularization, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 3714–3722.
- Masked autoencoders are scalable vision learners. arXiv preprint arXiv:2111.06377 .
- Deep residual learning for image recognition, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778.
- The many faces of robustness: A critical analysis of out-of-distribution generalization, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8340–8349.
- Benchmarking neural network robustness to common corruptions and perturbations. arXiv preprint arXiv:1903.12261 .
- Augmix: A simple data processing method to improve robustness and uncertainty. arXiv preprint arXiv:1912.02781 .
- Natural adversarial examples, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 15262–15271.
- Gans trained by a two time-scale update rule converge to a local nash equilibrium. Advances in neural information processing systems 30.
- Contrastive learning with adversarial examples. Advances in Neural Information Processing Systems 33, 17081–17093.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 .
- Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 .
- Adversarial attacks and defences competition, in: The NIPS’17 Competition: Building Intelligent Systems. Springer, Cham, pp. 195–231.
- Dual manifold adversarial robustness: Defense against lp and non-lp adversarial attacks. Advances in Neural Information Processing Systems 33, 3487–3498.
- Swin transformer: Hierarchical vision transformer using shifted windows. arXiv preprint arXiv:2103.14030 .
- Towards deep learning models resistant to adversarial attacks. arXiv preprint arXiv:1706.06083 .
- A wavelet tour of signal processing. Elsevier, Burlington.
- Deepfool: a simple and accurate method to fool deep neural networks, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2574–2582.
- Distillation as a defense to adversarial perturbations against deep neural networks, in: 2016 IEEE symposium on security and privacy (SP), IEEE. pp. 582–597.
- Jpeg2000: Image compression fundamentals, standards and practice. Journal of Electronic Imaging 11, 286.
- Generating diverse high-fidelity images with vq-vae-2. Advances in neural information processing systems 32.
- Do imagenet classifiers generalize to imagenet?, in: International Conference on Machine Learning, PMLR. pp. 5389–5400.
- Imagenet large scale visual recognition challenge. International journal of computer vision 115, 211–252.
- Improving robustness against common corruptions with frequency biased models. arXiv preprint arXiv:2103.16241 .
- Disentangling adversarial robustness and generalization, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6976–6987.
- Rethinking the inception architecture for computer vision, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2818–2826.
- Efficientnet: Rethinking model scaling for convolutional neural networks, in: International Conference on Machine Learning, PMLR. pp. 6105–6114.
- Ensemble adversarial training: Attacks and defenses. arXiv preprint arXiv:1705.07204 .
- Examining the impact of blur on recognition by convolutional networks. arXiv preprint arXiv:1611.05760 .
- Wavelet filter evaluation for image compression. IEEE Transactions on image processing 4, 1053–1060.
- High-dimensional statistics: A non-asymptotic viewpoint. volume 48. Cambridge University Press.
- Weak convergence and empirical processes: with applications to statistics. Springer Science & Business Media.
- Adversarial examples improve image recognition, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 819–828.
- Mitigating adversarial effects through randomization. arXiv preprint arXiv:1711.01991 .
- Improving transferability of adversarial examples with input diversity, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2730–2739.
- Ood-bench: Quantifying and understanding two dimensions of out-of-distribution generalization, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7947–7958.
- Improved ood generalization via adversarial training and pretraing, in: International Conference on Machine Learning, PMLR. pp. 11987–11997.
- A fourier perspective on model robustness in computer vision. arXiv preprint arXiv:1906.08988 .
- Cutmix: Regularization strategy to train strong classifiers with localizable features, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6023–6032.
- mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 .
- The unreasonable effectiveness of deep features as a perceptual metric, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 586–595.
- Manifold projection for adversarial defense on face recognition, in: European Conference on Computer Vision, Springer. pp. 288–305.
- Chang Liu (864 papers)
- Wenzhao Xiang (10 papers)
- Yuan He (156 papers)
- Hui Xue (109 papers)
- Shibao Zheng (21 papers)
- Hang Su (224 papers)