Settling the Reward Hypothesis (2212.10420v2)

Published 20 Dec 2022 in cs.AI, cs.LG, math.ST, and stat.TH

Abstract: The reward hypothesis posits that, "all of what we mean by goals and purposes can be well thought of as maximization of the expected value of the cumulative sum of a received scalar signal (reward)." We aim to fully settle this hypothesis. This will not conclude with a simple affirmation or refutation, but rather specify completely the implicit requirements on goals and purposes under which the hypothesis holds.

Citations (20)

View on Semantic Scholar