Robust Deep Reinforcement Learning Through Adversarial Attacks and Training : A Survey (2403.00420v2)

Published 1 Mar 2024 in cs.LG and cs.AI

Abstract: Deep Reinforcement Learning (DRL) is a subfield of machine learning for training autonomous agents that take sequential actions across complex environments. Despite its significant performance in well-known environments, it remains susceptible to minor condition variations, raising concerns about its reliability in real-world applications. To improve usability, DRL must demonstrate trustworthiness and robustness. A way to improve the robustness of DRL to unknown changes in the environmental conditions and possible perturbations is through Adversarial Training, by training the agent against well-suited adversarial attacks on the observations and the dynamics of the environment. Addressing this critical issue, our work presents an in-depth analysis of contemporary adversarial attack and training methodologies, systematically categorizing them and comparing their objectives and operational mechanisms.

Authors (8)

Lucas Schott (3 papers)
Josephine Delas (1 paper)
Hatem Hajri (22 papers)
Elies Gherbi (2 papers)
Reda Yaich (4 papers)
Nora Boulahia-Cuppens (3 papers)
Frederic Cuppens (24 papers)
Sylvain Lamprier (40 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Robust Deep Reinforcement Learning Through Adversarial Attacks and Training : A Survey (2403.00420v2)

Summary

Related Papers