MAT: Mixed-Strategy Game of Adversarial Training in Fine-tuning (2306.15826v1)

Published 27 Jun 2023 in cs.CL and cs.AI

Abstract: Fine-tuning large-scale pre-trained LLMs has been demonstrated effective for various NLP tasks. Previous studies have established that incorporating adversarial training during the fine-tuning stage can significantly enhance model generalization and robustness. However, from the perspective of game theory, such utilizations of adversarial training correspond to pure-strategy games, which are inherently limited in terms of the scope of their strategies, thereby still having room for improvement. In order to push the performance boundaries, we propose a novel Mixed-strategy Adversarial Training algorithm (MAT). Methodologically, we derive the Nash equilibrium of a mixed-strategy game for adversarial training using Entropy Mirror Descent to establish MAT by sampling method. To verify the effectiveness of MAT, we conducted extensive benchmark experiments on large-scale pre-trained models, such as BERT and RoBERTa. MAT significantly outperforms the state-of-the-art methods on both the GLUE and ANLI benchmarks in terms of generalization and robustness.

References (52)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

MAT: Mixed-Strategy Game of Adversarial Training in Fine-tuning (2306.15826v1)

Summary

Follow-up Questions

Related Papers

Authors (3)