Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation (2201.02009v2)

Published 6 Jan 2022 in cs.CL

Abstract: While end-to-end neural machine translation (NMT) has achieved impressive progress, noisy input usually leads models to become fragile and unstable. Generating adversarial examples as the augmented data has been proved to be useful to alleviate this problem. Existing methods for adversarial example generation (AEG) are word-level or character-level, which ignore the ubiquitous phrase structure. In this paper, we propose a Phrase-level Adversarial Example Generation (PAEG) framework to enhance the robustness of the translation model. Our method further improves the gradient-based word-level AEG method by adopting a phrase-level substitution strategy. We verify our method on three benchmarks, including LDC Chinese-English, IWSLT14 German-English, and WMT14 English-German tasks. Experimental results demonstrate that our approach significantly improves translation performance and robustness to noise compared to previous strong baselines.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Juncheng Wan (3 papers)
  2. Jian Yang (505 papers)
  3. Shuming Ma (83 papers)
  4. Dongdong Zhang (79 papers)
  5. Weinan Zhang (322 papers)
  6. Yong Yu (219 papers)
  7. Zhoujun Li (122 papers)
Citations (4)