Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Triggerless Backdoor Attack for NLP Tasks with Clean Labels (2111.07970v2)

Published 15 Nov 2021 in cs.CL, cs.AI, and cs.CR

Abstract: Backdoor attacks pose a new threat to NLP models. A standard strategy to construct poisoned data in backdoor attacks is to insert triggers (e.g., rare words) into selected sentences and alter the original label to a target label. This strategy comes with a severe flaw of being easily detected from both the trigger and the label perspectives: the trigger injected, which is usually a rare word, leads to an abnormal natural language expression, and thus can be easily detected by a defense model; the changed target label leads the example to be mistakenly labeled and thus can be easily detected by manual inspections. To deal with this issue, in this paper, we propose a new strategy to perform textual backdoor attacks which do not require an external trigger, and the poisoned samples are correctly labeled. The core idea of the proposed strategy is to construct clean-labeled examples, whose labels are correct but can lead to test label changes when fused with the training set. To generate poisoned clean-labeled examples, we propose a sentence generation model based on the genetic algorithm to cater to the non-differentiable characteristic of text data. Extensive experiments demonstrate that the proposed attacking strategy is not only effective, but more importantly, hard to defend due to its triggerless and clean-labeled nature. Our work marks the first step towards developing triggerless attacking strategies in NLP.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Leilei Gan (21 papers)
  2. Jiwei Li (137 papers)
  3. Tianwei Zhang (199 papers)
  4. Xiaoya Li (42 papers)
  5. Yuxian Meng (37 papers)
  6. Fei Wu (317 papers)
  7. Yi Yang (856 papers)
  8. Shangwei Guo (32 papers)
  9. Chun Fan (16 papers)
Citations (73)

Summary

We haven't generated a summary for this paper yet.