Principled reward design for reinforcement learning in autonomous driving
Develop a principled methodology for designing reward functions for reinforcement learning-based autonomous driving that effectively guides learning in dynamic traffic while avoiding suboptimal rule-based heuristics and aligning with evaluation metrics.
References
Due to the flexibility in optimization, there are many ways to define the reward for driving, making reward design an important open problem .
— CaRL: Learning Scalable Planning Policies with Simple Rewards
(2504.17838 - Jaeger et al., 24 Apr 2025) in Appendix, Section 'Related work', subsubsection 'Rewards for Driving'