Reward function corresponding to human driving behavior

Determine a reward function that corresponds to human driving behavior for use in multi-agent reinforcement learning for autonomous driving, clarifying the target objective that accurately captures how people drive in interactive traffic scenarios.

Background

In the discussion of multi-agent reinforcement learning for driving, the paper notes that one approach to achieving human-compatible behavior is to encode human conventions via reward design.

However, the authors explicitly state that the correct reward function corresponding to human driving is not known, and they also caution that reward shaping can induce undesired behaviors.

This highlights a foundational open question about the specification of the optimization objective for learning agents that behave in a human-like manner in traffic.

References

However, it is not entirely clear what reward function corresponds to human driving and the inclusion of this type of reward shaping can create undesired behaviors~\citep{knox2023reward}.

— Human-compatible driving partners through data-regularized self-play reinforcement learning (2403.19648 - Cornelisse et al., 2024) in Section 4 Related work

Reward function corresponding to human driving behavior

Background

References

Related Problems