Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving (2306.03220v2)

Published 5 Jun 2023 in cs.RO and cs.AI

Abstract: Reinforcement learning (RL) is an effective approach to motion planning in autonomous driving, where an optimal driving policy can be automatically learned using the interaction data with the environment. Nevertheless, the reward function for an RL agent, which is significant to its performance, is challenging to be determined. The conventional work mainly focuses on rewarding safe driving states but does not incorporate the awareness of risky driving behaviors of the vehicles. In this paper, we investigate how to use risk-aware reward shaping to leverage the training and test performance of RL agents in autonomous driving. Based on the essential requirements that prescribe the safety specifications for general autonomous driving in practice, we propose additional reshaped reward terms that encourage exploration and penalize risky driving behaviors. A simulation study in OpenAI Gym indicates the advantage of risk-aware reward shaping for various RL agents. Also, we point out that proximal policy optimization (PPO) is likely to be the best RL method that works with risk-aware reward shaping.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Lin-Chi Wu (1 paper)
  2. Zengjie Zhang (23 papers)
  3. Sofie Haesaert (42 papers)
  4. Zhiqiang Ma (19 papers)
  5. Zhiyong Sun (73 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.