Reward function specification for real-world reinforcement learning
Determine how to specify reward functions in reinforcement learning that reliably induce desired behavior across the full range of operating conditions encountered in real-world physical deployments of embodied agents.
References
Specifying a reward function that produces the desired behavior across the full range of operating conditions encountered in physical deployment is notoriously difficult and remains an open problem.
— Active Inference for Physical AI Agents -- An Engineering Perspective
(2603.20927 - Vries, 21 Mar 2026) in Section 8.3, Active Inference vs. Reinforcement Learning