Partially Observable Discrete-time Discounted Markov Games with General Utility (2211.07888v1)

Published 15 Nov 2022 in math.OC

Abstract: In this paper, we investigate a partially observable zero sum games where the state process is a discrete time Markov chain. We consider a general utility function in the optimization criterion. We show the existence of value for both finite and infinite horizon games and also establish the existence of optimal polices. The main step involves converting the partially observable game into a completely observable game which also keeps track of the total discounted accumulated reward/cost.

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Partially Observable Discrete-time Discounted Markov Games with General Utility (2211.07888v1)

Summary

Related Papers