2000 character limit reached
Pure Strategy Best Responses to Mixed Strategies in Repeated Games (1902.09066v1)
Published 25 Feb 2019 in cs.GT
Abstract: Repeated games are difficult to analyze, especially when agents play mixed strategies. We study one-memory strategies in iterated prisoner's dilemma, then generalize the result to k-memory strategies in repeated games. Our result shows that there always exists a pure strategy best response, which can be computed with SMT or MDP solvers. However, there may not exist such pure strategy best response in multi-agent tournaments. All source code is released for verification.