Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
146 tokens/sec
GPT-4o
10 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Safety-Critical Scenario Generation Via Reinforcement Learning Based Editing (2306.14131v3)

Published 25 Jun 2023 in cs.LG and cs.RO

Abstract: Generating safety-critical scenarios is essential for testing and verifying the safety of autonomous vehicles. Traditional optimization techniques suffer from the curse of dimensionality and limit the search space to fixed parameter spaces. To address these challenges, we propose a deep reinforcement learning approach that generates scenarios by sequential editing, such as adding new agents or modifying the trajectories of the existing agents. Our framework employs a reward function consisting of both risk and plausibility objectives. The plausibility objective leverages generative models, such as a variational autoencoder, to learn the likelihood of the generated parameters from the training datasets; It penalizes the generation of unlikely scenarios. Our approach overcomes the dimensionality challenge and explores a wide range of safety-critical scenarios. Our evaluation demonstrates that the proposed method generates safety-critical scenarios of higher quality compared with previous approaches.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (23)
  1. Generating adversarial driving scenarios in high-fidelity simulators. In 2019 International Conference on Robotics and Automation (ICRA), pages 8271–8277, 2019.
  2. Chauffeurnet: Learning to drive by imitating the best and synthesizing the worst. CoRR, abs/1812.03079, 2018.
  3. Multipath: Multiple probabilistic anchor trajectory hypotheses for behavior prediction. In Conference on Robot Learning, 2019.
  4. Argoverse: 3d tracking and forecasting with rich maps. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8748–8757, 2019.
  5. Trafficgen: Learning to generate diverse and realistic traffic scenarios. arXiv preprint arXiv:2210.06609, 2022.
  6. Vectornet: Encoding hd maps and agent dynamics from vectorized representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11525–11533, 2020.
  7. Generating and characterizing scenarios for safety testing of autonomous vehicles. In 2021 IEEE Intelligent Vehicles Symposium (IV), pages 157–164, 2021.
  8. Autonomy 2.0: Why is self-driving always 5 years away? CoRR, abs/2107.08142, 2021.
  9. Bayesian optimization with tree-structured dependencies. In D. Precup and Y. W. Teh, editors, Proceedings of the 34th International Conference on Machine Learning, volume 70 of Proceedings of Machine Learning Research, pages 1655–1664. PMLR, 06–11 Aug 2017.
  10. Desire: Distant future prediction in dynamic scenes with interacting agents. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 336–345, 2017.
  11. E. Leurent. An environment for autonomous driving decision-making, 2018.
  12. Hierarchical learning-based autonomy simulator.
  13. Interpretable and flexible target-conditioned neural planners for autonomous vehicles. In 2023 IEEE International Conference on Robotics and Automation (ICRA), pages 10076–10082. IEEE, 2023.
  14. Large-scale long-tailed recognition in an open world. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2537–2546, 2019.
  15. Structured variationally auto-encoded optimization. In International Conference on Machine Learning, 2018.
  16. On exposing the challenging long tail in future prediction of traffic actors. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 13147–13157, October 2021.
  17. Scalable end-to-end autonomous vehicle testing via rare-event simulation. In Proceedings of the 32nd International Conference on Neural Information Processing Systems, NIPS’18, page 9849–9860, Red Hook, NY, USA, 2018. Curran Associates Inc.
  18. Bevsegformer: Bird’s eye view semantic segmentation from arbitrary camera rigs, 2022.
  19. Generating useful accident-prone driving scenarios via a learned traffic prior. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 17284–17294, 2021.
  20. Proximal policy optimization algorithms. CoRR, abs/1707.06347, 2017.
  21. Mixsim: A hierarchical framework for mixed reality traffic simulation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9622–9631, June 2023.
  22. A. Wachi. Failure-scenario maker for rule-based agent using multi-agent adversarial reinforcement learning and its application to autonomous driving. In International Joint Conference on Artificial Intelligence, 2019.
  23. Advsim: Generating safety-critical scenarios for self-driving vehicles. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9909–9918, June 2021.
Citations (3)

Summary

We haven't generated a summary for this paper yet.