2000 character limit reached
Variance Adjusted Actor Critic Algorithms
Published 14 Oct 2013 in stat.ML, cs.LG, and cs.SY | (1310.3697v1)
Abstract: We present an actor-critic framework for MDPs where the objective is the variance-adjusted expected return. Our critic uses linear function approximation, and we extend the concept of compatible features to the variance-adjusted setting. We present an episodic actor-critic algorithm and show that it converges almost surely to a locally optimal point of the objective function.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.