Mechanisms of aligning human value systems to current goals
Determine the cognitive mechanisms by which humans align their value system to current goals and identify the specific factors that make such alignment difficult to maintain during goal-dependent reinforcement learning and outcome evaluation.
References
However, the precise mechanisms by which humans align their value system to current goals -- and which aspects make such alignment hard to maintain -- remain uncharted.
— Reward function compression facilitates goal-dependent reinforcement learning
(2509.06810 - Molinaro et al., 8 Sep 2025) in Introduction