On Robust Reinforcement Learning with Lipschitz-Bounded Policy Networks
Abstract: This paper presents a study of robust policy networks in deep reinforcement learning. We investigate the benefits of policy parameterizations that naturally satisfy constraints on their Lipschitz bound, analyzing their empirical performance and robustness on two representative problems: pendulum swing-up and Atari Pong. We illustrate that policy networks with smaller Lipschitz bounds are more robust to disturbances, random noise, and targeted adversarial attacks than unconstrained policies composed of vanilla multi-layer perceptrons or convolutional neural networks. However, the structure of the Lipschitz layer is important. We find that the widely-used method of spectral normalization is too conservative and severely impacts clean performance, whereas more expressive Lipschitz layers such as the recently-proposed Sandwich layer can achieve improved robustness without sacrificing clean performance.
- In: International Conference on Learning Representations (2021)
- In: International Conference on Learning Representations (2023)
- Computer Vision – ECCV 2022 pp. 350–365 (2022)
- Advances in Neural Information Processing Systems (2021)
- URL http://github.com/google/jax
- arXiv preprint arXiv:1810.00069 (2018)
- arXiv preprint arXiv:1902.07623 (2019)
- Advances in Neural Information Processing Systems (NeurIPS) (2019)
- arXiv preprint arXiv:2106.13281 (2021)
- Journal of Machine Learning Research 23(274), 1–18 (2022)
- International Conference on Learning Representations (2017)
- Proceedings of The 2nd Conference on Robot Learning pp. 651–673 (2018)
- International Conference on Learning Representations (2022)
- International Conference on Learning Representations (2018)
- IEEE Transactions on Automatic Control 42(6), 819–830 (1997)
- In: International Conference on Learning Representations (2018)
- Nature 518, 529–533 (2015)
- Proceedings of the AAAI Conference on Artificial Intelligence 38, 14,457–14,465 (2024)
- Advances in Neural Information Processing Systems 34, 26,156–26,167 (2021)
- International Joint Conference on Autonomous Agents and Multiagent Systems 3, 2040–2042 (2017)
- arXiv:2010.01732 (2020)
- Proceedings of Machine Learning Research 164, 91–100 (2021)
- American Control Conference pp. 4561–4567 (2021)
- arXiv:1707.06347 (2017)
- International Conference on Learning Representations (2013)
- Machine Learning with Applications 10, 100,409 (2022)
- International Conference on Intelligent Robots and Systems pp. 5026–5033 (2012)
- International Conference on Learning Representations (2021)
- Advances in Neural Information Processing Systems 31 (2018)
- International Conference on Machine Learning (2023)
- Advances in Neural Information Processing Systems (2022)
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.