Papers
Topics
Authors
Recent
Search
2000 character limit reached

On Robust Reinforcement Learning with Lipschitz-Bounded Policy Networks

Published 19 May 2024 in cs.LG, cs.SY, and eess.SY | (2405.11432v3)

Abstract: This paper presents a study of robust policy networks in deep reinforcement learning. We investigate the benefits of policy parameterizations that naturally satisfy constraints on their Lipschitz bound, analyzing their empirical performance and robustness on two representative problems: pendulum swing-up and Atari Pong. We illustrate that policy networks with smaller Lipschitz bounds are more robust to disturbances, random noise, and targeted adversarial attacks than unconstrained policies composed of vanilla multi-layer perceptrons or convolutional neural networks. However, the structure of the Lipschitz layer is important. We find that the widely-used method of spectral normalization is too conservative and severely impacts clean performance, whereas more expressive Lipschitz layers such as the recently-proposed Sandwich layer can achieve improved robustness without sacrificing clean performance.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (31)
  1. In: International Conference on Learning Representations (2021)
  2. In: International Conference on Learning Representations (2023)
  3. Computer Vision – ECCV 2022 pp. 350–365 (2022)
  4. Advances in Neural Information Processing Systems (2021)
  5. URL http://github.com/google/jax
  6. arXiv preprint arXiv:1810.00069 (2018)
  7. arXiv preprint arXiv:1902.07623 (2019)
  8. Advances in Neural Information Processing Systems (NeurIPS) (2019)
  9. arXiv preprint arXiv:2106.13281 (2021)
  10. Journal of Machine Learning Research 23(274), 1–18 (2022)
  11. International Conference on Learning Representations (2017)
  12. Proceedings of The 2nd Conference on Robot Learning pp. 651–673 (2018)
  13. International Conference on Learning Representations (2022)
  14. International Conference on Learning Representations (2018)
  15. IEEE Transactions on Automatic Control 42(6), 819–830 (1997)
  16. In: International Conference on Learning Representations (2018)
  17. Nature 518, 529–533 (2015)
  18. Proceedings of the AAAI Conference on Artificial Intelligence 38, 14,457–14,465 (2024)
  19. Advances in Neural Information Processing Systems 34, 26,156–26,167 (2021)
  20. International Joint Conference on Autonomous Agents and Multiagent Systems 3, 2040–2042 (2017)
  21. arXiv:2010.01732 (2020)
  22. Proceedings of Machine Learning Research 164, 91–100 (2021)
  23. American Control Conference pp. 4561–4567 (2021)
  24. arXiv:1707.06347 (2017)
  25. International Conference on Learning Representations (2013)
  26. Machine Learning with Applications 10, 100,409 (2022)
  27. International Conference on Intelligent Robots and Systems pp. 5026–5033 (2012)
  28. International Conference on Learning Representations (2021)
  29. Advances in Neural Information Processing Systems 31 (2018)
  30. International Conference on Machine Learning (2023)
  31. Advances in Neural Information Processing Systems (2022)
Citations (1)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 2 tweets with 0 likes about this paper.