Efficient Trigger Word Insertion (2311.13957v1)

Published 23 Nov 2023 in cs.CR and cs.CL

Abstract: With the boom in the NLP field these years, backdoor attacks pose immense threats against deep neural network models. However, previous works hardly consider the effect of the poisoning rate. In this paper, our main objective is to reduce the number of poisoned samples while still achieving a satisfactory Attack Success Rate (ASR) in text backdoor attacks. To accomplish this, we propose an efficient trigger word insertion strategy in terms of trigger word optimization and poisoned sample selection. Extensive experiments on different datasets and models demonstrate that our proposed method can significantly improve attack effectiveness in text classification tasks. Remarkably, our approach achieves an ASR of over 90% with only 10 poisoned samples in the dirty-label setting and requires merely 1.5% of the training data in the clean-label setting.

View on arXiv

References (35)

Authors (5)

Yueqi Zeng (3 papers)
Ziqiang Li (40 papers)
Pengfei Xia (28 papers)
Lei Liu (332 papers)
Bin Li (514 papers)

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Efficient Trigger Word Insertion (2311.13957v1)

Summary

Related Papers