Elastic-Net Attacks on Deep Neural Networks

Updated 28 June 2026

The paper introduces EAD, formulating adversarial example generation as an elastic-net regularized optimization to achieve high attack success rates and enhanced transferability.
EAD leverages a projected ISTA/FISTA algorithm to manage the non-differentiable L1 term, producing targeted sparse perturbations on select pixels.
Empirical results on MNIST, CIFAR-10, and ImageNet demonstrate EAD’s ability to reduce L1 distortion while maintaining competitive L2 and L∞ metrics.

The elastic-net attack to deep neural networks (EAD) is a white-box adversarial attack method that formulates adversarial example generation as an elastic-net regularized (combined $L_1$ + $L_2$ ) optimization problem. EAD generalizes strong $L_2$ -based attacks by incorporating an $L_1$ penalty, producing sparse but high-magnitude perturbations and yielding attack instances with greater transferability and complementary value for adversarial training. Empirical results on benchmark datasets demonstrate that EAD achieves high attack success rates (ASR), notably reduced $L_1$ distortion, and superior cross-model transfer compared to strictly $L_2$ or $L_\infty$ -constrained attacks (Chen et al., 2017, Sharma et al., 2017).

1. Mathematical Formulation

EAD posits adversarial example generation as solving an elastic-net regularized optimization problem under a box constraint. For an original image $x_0 \in [0,1]^p$ (pixel values normalized to $[0,1]$ ) with ground-truth label $t_0$ , and attack target $L_2$ 0, the elastic-net attack seeks

$L_2$ 1

where $L_2$ 2, with $L_2$ 3 the confidence margin parameter and $L_2$ 4 trading off attack imperceptibility with misclassification success. Setting $L_2$ 5 specializes EAD to the Carlini & Wagner (C&W) $L_2$ 6 attack. For non-targeted attacks, $L_2$ 7 can be the negative margin on the true class (Chen et al., 2017, Sharma et al., 2017).

The $L_2$ 8 term penalizes the overall energy of the perturbation, while the $L_2$ 9 term (weighted by $L_2$ 0) imposes sparsity and localizes changes onto a small subset of pixels, capitalizing on visual insensitivity to concentrated alteration.

2. Optimization Algorithm

The presence of the non-differentiable $L_2$ 1 term precludes pure gradient-based methods. EAD employs a projected iterative shrinkage-thresholding algorithm (ISTA), and typically its accelerated variant FISTA, to solve the elastic-net program under box constraints:

Subgradient update: Compute the (sub)gradient of $L_2$ 2 at current iterate.
Gradient descent step: $L_2$ 3 with adaptive learning rate $L_2$ 4.
Proximal shrinkage: Apply component-wise soft thresholding:

$L_2$ 5

Box projection: Clip $L_2$ 6 to $L_2$ 7.
FISTA acceleration: Momentum update for $L_2$ 8.

This inner loop is run up to $L_2$ 9 times, embedded within a binary search for $L_1$ 0 over 9 steps, beginning at $L_1$ 1. Two decision rules are used for selecting successful adversarial examples: the "EN-rule" (minimum elastic-net objective among $L_1$ 2 iterates), and the "L1-rule" (minimum $L_1$ 3 distortion among $L_1$ 4 iterates). $L_1$ 5 is typically set manually, generally between $L_1$ 6 and $L_1$ 7, with $L_1$ 8 providing a practical default (Chen et al., 2017, Sharma et al., 2017).

3. Empirical Evaluation and Distortion Metrics

Attacks are performed and evaluated on MNIST (LeNet), CIFAR-10 (ResNet-like), and ImageNet (Inception-v3) models using 1,000 randomly selected test samples (MNIST/CIFAR-10) and 100 for ImageNet. Baseline methods include FGM (Fast Gradient Method) and I-FGM in $L_1$ 9, $L_1$ 0, and $L_1$ 1 forms, as well as the C&W $L_1$ 2 attack.

The following summarizes mean-case results across datasets (ASR = attack success rate):

Dataset / Method	ASR (%)	$L_1$ 3	$L_1$ 4	$L_1$ 5
MNIST
C&W ( $L_1$ 6)	100	22.46	1.97	0.514
I-FGM- $L_1$ 7	100	32.94	2.61	0.591
EAD (EN)	100	17.40	2.00	0.594
EAD ( $L_1$ 8)	100	14.11	2.21	0.768
CIFAR-10
C&W ( $L_1$ 9)	100	13.62	0.392	0.044
I-FGM- $L_2$ 0	100	17.53	0.502	0.055
EAD (EN)	100	8.18	0.502	0.097
EAD ( $L_2$ 1)	100	6.07	0.613	0.17
ImageNet
C&W ( $L_2$ 2)	100	232.2	0.705	0.030
I-FGM- $L_2$ 3	77	526.4	1.609	0.054
EAD (EN)	100	69.47	1.563	0.238
EAD ( $L_2$ 4)	100	40.90	1.598	0.293

EAD achieves 100% ASR on all datasets. The $L_2$ 5-minimizing variants produce significantly sparser perturbations than both I-FGM- $L_2$ 6 and C&W. As $L_2$ 7 increases, $L_2$ 8 distortion decreases monotonically until a trade-off point, at the expense of increasing $L_2$ 9 and $L_\infty$ 0 norms.

4. Transferability and Adversarial Training

EAD adversarial examples display enhanced transferability across models:

Defensive distillation: EAD ( $L_\infty$ 1) and C&W ( $L_\infty$ 2) both maintain 100% ASR for distilled networks at all $L_\infty$ 3 when run with $L_\infty$ 4.
Cross-model transfer: On MNIST, EAD (EN) peaks at mean ASR $L_\infty$ 5 at $L_\infty$ 6, surpassing C&W ( $L_\infty$ 7 at $L_\infty$ 8). I-FGM methods transfer poorly ( $L_\infty$ 9 ASR).
Adversarial training: Networks adversarially trained exclusively on $x_0 \in [0,1]^p$ 0 (C&W) or $x_0 \in [0,1]^p$ 1 (EAD) attacks raise respective distortion thresholds only for their own norm. Joint augmentation with both $x_0 \in [0,1]^p$ 2 and $x_0 \in [0,1]^p$ 3 attacks improves robustness in both measures beyond single-mode adversarial training, confirming complementarity of $x_0 \in [0,1]^p$ 4-based perturbations (Chen et al., 2017).

5. Interpretability, Visual Distortion, and Metric Critique

EAD demonstrates that hard $x_0 \in [0,1]^p$ 5 constraints, such as in the Madry Defense Model, can be evaded by permitting sparse, high-magnitude perturbations. EAD perturbations, focused on a limited set of pixels, can exhibit much higher $x_0 \in [0,1]^p$ 6 while maintaining low $x_0 \in [0,1]^p$ 7 and low perceptual distortion. Visualizations reveal that EAD concentrates changes along digit strokes or object edges, in contrast to PGD and FGM attacks, which diffuse small noise across all pixels. This finding undermines the sufficiency of $x_0 \in [0,1]^p$ 8 as a proxy for human perceptual similarity. As shown in attacks on the Madry model, EAD with $x_0 \in [0,1]^p$ 9 and $[0,1]$ 0 achieves targeted ASR $[0,1]$ 1 at $[0,1]$ 2, $[0,1]$ 3, $[0,1]$ 4, outperforming both PGD and C&W (Sharma et al., 2017).

6. Practical Implementation and Recommendations

Hyperparametrization: Binary search 9 steps on $[0,1]$ 5 (start at $[0,1]$ 6); inner FISTA with $[0,1]$ 7, $[0,1]$ 8, $[0,1]$ 9 decaying as $t_0$ 0. Preferred $t_0$ 1 in $t_0$ 2; for transferability $t_0$ 3 is effective. $t_0$ 4 in $t_0$ 5 balances visibility and transfer, with $t_0$ 6 typically optimal.
Early stopping: Halt when a successful adversarial example with minimal objective is found.
Transfer augmentation: For high transferability, use an ensemble of multiple (e.g., three) naturally trained networks for crafting.
Pixel preprocessing: Normalize inputs to $t_0$ 7 prior to attack generation (Chen et al., 2017, Sharma et al., 2017).

7. Security Implications and Research Directions

EAD exposes DNN vulnerabilities that are not detectable by restricting to $t_0$ 8 or $t_0$ 9 threat models alone. Sparse, high-magnitude perturbations can be highly effective, calling for the adoption of multi-norm analysis in security auditing. The elastic-net framework provides a constructive means of synthesizing diverse attack profiles, with clear implications for the development of robust classifiers. EAD simultaneously retains the ability to break strong defenses (defensive distillation), enhances attack transferability, and substantially augments adversarial training—suggesting that regularization with $L_2$ 00 distortion is essential to both attacking and defending DNNs in adversarial settings (Chen et al., 2017, Sharma et al., 2017).

Markdown Report Issue Upgrade to Chat

References (2)

EAD: Elastic-Net Attacks to Deep Neural Networks via Adversarial Examples (2017)

Attacking the Madry Defense Model with $L_1$-based Adversarial Examples (2017)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Elastic-Net Attacks to Deep Neural Networks (EAD).

Elastic-Net Attacks on Deep Neural Networks

1. Mathematical Formulation

2. Optimization Algorithm

3. Empirical Evaluation and Distortion Metrics

4. Transferability and Adversarial Training

5. Interpretability, Visual Distortion, and Metric Critique

6. Practical Implementation and Recommendations

7. Security Implications and Research Directions

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Elastic-Net Attacks on Deep Neural Networks

1. Mathematical Formulation

2. Optimization Algorithm

3. Empirical Evaluation and Distortion Metrics

4. Transferability and Adversarial Training

5. Interpretability, Visual Distortion, and Metric Critique

6. Practical Implementation and Recommendations

7. Security Implications and Research Directions

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research