BlackboxBench: A Comprehensive Benchmark of Black-box Adversarial Attacks (2312.16979v2)

Published 28 Dec 2023 in cs.CR

Abstract: Adversarial examples are well-known tools to evaluate the vulnerability of deep neural networks (DNNs). Although lots of adversarial attack algorithms have been developed, it's still challenging in the practical scenario that the model's parameters and architectures are inaccessible to the attacker/evaluator, i.e., black-box adversarial attacks. Due to the practical importance, there has been rapid progress from recent algorithms, reflected by the quick increase in attack success rate and quick decrease in query numbers to the target model. However, there lacks thorough evaluations and comparisons among these algorithms, causing difficulties in tracking the real progress, analyzing advantages and disadvantages of different technical routes, as well as designing future development roadmap of this field. Thus, we aim at building a comprehensive benchmark of black-box adversarial attacks, called BlackboxBench. It mainly provides: 1) a unified, extensible and modular-based codebase, implementing 29 query-based attack algorithms and 30 transfer-based attack algorithms; 2) comprehensive evaluations: we evaluate the implemented algorithms against several mainstreaming model architectures on 2 widely used datasets (CIFAR-10 and a subset of ImageNet), leading to 14,950 evaluations in total; 3) thorough analysis and new insights, as well analytical tools. The website and source codes of BlackboxBench are available at https://blackboxbenchmark.github.io/ and https://github.com/SCLBD/BlackboxBench/, respectively.

References (135)

Citations (4)

View on Semantic Scholar

Summary

The paper introduces BlackboxBench, a unified codebase implementing 25 query-based and 30 transfer-based adversarial attacks for standardized benchmarking.
It conducts 14,106 evaluations on CIFAR-10 and a subset of ImageNet to establish robust performance benchmarks and leaderboards.
It offers detailed analysis across 13 metrics to uncover adversarial vulnerabilities and guide the development of stronger DNN defenses.

Insightful Overview of the "BlackboxBench: A Comprehensive Benchmark of Black-box Adversarial Attacks" Paper

The paper "BlackboxBench: A Comprehensive Benchmark of Black-box Adversarial Attacks" addresses the complexities and challenges associated with black-box adversarial attacks on deep neural networks (DNNs). This paper presents BlackboxBench, a benchmarking tool and codebase designed to evaluate and compare various black-box adversarial attacks effectively. The authors emphasize the significance of black-box adversarial attacks, especially in practical scenarios where attackers have limited access to a target model's internal parameters and architectures.

Key Contributions

The cornerstone of this paper is the introduction of BlackboxBench, which is meticulously crafted to offer several contributions:

Unified Codebase: BlackboxBench provides a modular and unified codebase that currently implements 25 query-based and 30 transfer-based adversarial attack algorithms, making it a comprehensive resource in this field. This extensibility ensures that the benchmark can evolve with the introduction of new methods.
Comprehensive Evaluations: The authors conduct a vast number of evaluations, amounting to 14,106 in total, using widely recognized datasets such as CIFAR-10 and a subset of ImageNet. Such exhaustive testing establishes benchmarks and leaderboards that document the progress of black-box adversarial attack methodologies.
Thorough Analysis: The paper offers detailed analysis and insights into black-box adversarial attacks through 13 types of analysis, assisted by analytical tools designed to broaden the understanding of adversarial vulnerabilities. This equips researchers with the tools needed to uncover the underlying mechanisms of these attacks and strengthen DNN robustness.

Numerical Results and Analysis

The paper delivers strong numerical results from its evaluation suite that exhibits the rapid progression of black-box adversarial attack methods. For instance, it demonstrates the evolution of attack efficiency over the years, distinguishing between decision-based and score-based methodologies. Decision-based attacks have witnesses a marked improvement, displaying enhanced attack success rates (ASR) and reduced average query numbers (AQN). Transfer-based attacks are similarly detailed, showing improvements over various years and highlighting feature-space and model-based attacks as particularly effective strategies, despite their increased computational overhead.

The paper highlights challenges in targeted attacks, wherein achieving high transferability remains difficult compared to untargeted attacks. However, model-enhancing techniques have marginally bridged this gap, suggesting potential future directions in this subfield.

Implications and Future Prospects

The implications of this research are both practical and theoretical. Practically, BlackboxBench serves as a vital tool for researchers to benchmark and validate emerging black-box adversarial attack algorithms. The unified codification offers a standardized approach to testing, making results comparable across different studies. Theoretically, the analyses foster a deeper understanding of the vulnerabilities inherent in neural networks and inspire the development of more robust defense mechanisms against such attacks.

Looking to the future, this benchmark sets the stage for further exploration into the transferability of adversarial attacks and the mitigation of adversarial effects in real-world applications. Additionally, the modularity of BlackboxBench invites contributions from the research community, potentially paving the way for innovative defenses and more sophisticated attack strategies. The emphasis on improving transferability in targeted attacks might emerge as a primary focus for subsequent research efforts.

In conclusion, the BlackboxBench presents a substantial advancement in the domain of black-box adversarial attacks, providing valuable insights and a framework for continued research into the resilience and security of machine learning models against complex adversarial threats.

PDF Markdown

GitHub

GitHub - SCLBD/BlackboxBench (99 stars)
GitHub - SCLBD/BlackboxBench (79 stars)