Pure-Strategy Perfect Equilibrium
- Pure-strategy perfect equilibrium is a game theory concept defined by deterministic strategies that are optimal in every subgame and under slight perturbations.
- The equilibrium employs value recursion and dynamic programming to exclude non-credible mixed strategies, particularly in stochastic and extensive-form games.
- Practical applications include solving large extensive-form games and verifying trembling-hand perfection, although NP-hard issues arise in multi-player settings.
A pure-strategy perfect equilibrium is an equilibrium refinement in game theory characterized by profiles in which each player's strategy is deterministic (pure) and satisfies optimality within every conceivable subgame or in response to all vanishing perturbations of opponents' strategies. It excludes reliance on randomization except where warranted by the structure of the game, and enforces robust credibility of threats and choices throughout the entire game tree or strategic form.
1. Formal Definitions and Conceptual Frameworks
Three principal notions arise in the literature:
- Markov Perfect Equilibrium (MPE) in Continuous-Time Games In dynamic settings (e.g., war-of-attrition), a Markov pure strategy specifies closed exit regions , with exit occurring only upon hitting , and a hazard function vanishes identically, i.e., (Georgiadis et al., 2021). The equilibrium requirement is
for every alternative and state , with the expected discounted payoff given the Markov strategies.
- Trembling-Hand Perfect Equilibrium (THPE) in Strategic Form Games Given pure strategy profile in , is trembling-hand perfect if there exists a sequence of fully mixed strategy profiles converging to such that is a best reply to for all and (0812.0492). The defining property is:
- Subgame-Perfect Equilibrium (SPE) via Value Recursion in Extensive-Form Games A pure-strategy profile is subgame-perfect if it induces Nash equilibrium play in every subgame. In pentaform (extensive-form) representation, equilibrium is characterized by value recursion over subroots, formalized as
where each restriction is Nash on piece-form (Streufert, 2023).
2. Existence and Structure in Dynamic and Strategic Games
2.1 Dynamic War-of-Attrition (Stochastic Payoff Diffusion)
Pure-strategy MPEs emerge in stochastic war-of-attrition with heterogeneous exit payoffs (i.e., ) and irreducible diffusion , . Each firm's optimal response is characterized by threshold rules:
- Define single-player stopping thresholds as solutions to the optimal stopping problem (Lemma 1), yielding .
- Two MPE profiles:
- Profile A: Firm $2$ exits once ; Firm $1$ never exits.
- Profile B (when is small): Firm $1$ exits once ; Firm $2$ never exits.
- The equilibrium region is sharply delineated: for , only A or B can occur (Georgiadis et al., 2021).
2.2 Elimination of Mixed Strategies
Under stochastic payoffs () and heterogeneous exit values, all mixed-strategy Markov equilibria are excluded. Lemma 2 demonstrates that common support and the need for indifference cannot be achieved when (Theorem 1). If payoffs are deterministic () or exit payoffs are homogeneous (), classic mixed-strategy equilibria remain feasible.
3. Computation and Verification Complexity
3.1 Trembling-Hand Perfection: NP-Hardness
For strategic-form games with players and integer payoffs, deciding whether a given pure-strategy profile is trembling-hand perfect is shown to be NP-hard (0812.0492). The reduction originates from the three-player MINMAX problem and constructs an augmented game with an added absorbing strategy :
- If the minmax value for Player 1 in the base game is , then is trembling-hand perfect.
- If the minmax value is , trembling-hand perfection fails for . No polynomial algorithm can certify perfection unless P=NP. For two-player games, trembling-hand perfection coincides with dominance and is computable via LP.
3.2 Dynamic Programming Approach for Extensive Games
The value-recursion characterization for pure-strategy subgame-perfect equilibria allows recursive computation over subroots ("piece-forms"). Provided upper- and lower-convergence of the utility function , admissibility and persistence of value functions ensure authenticity—meaning that the continuation value at each subroot matches the realized utility along the induced outcome (Streufert, 2023).
4. Mathematical Characterizations and Criteria
4.1 Hamilton–Jacobi–Bellman and Variational Inequality
In continuous-time dynamic games, candidate value functions must solve the variational inequality: with boundary conditions ("smooth-pasting") at thresholds : These characterize optimal exit policies (and equilibrium regions).
4.2 One-Piece Unimprovability
Pure-strategy subgame-perfect equilibrium in pentaform games is equivalent to "one-piece unimprovability": for each subroot , player 's realized utility weakly dominates the utility attainable by any single-piece deviation at (Streufert, 2023).
5. Comparative Cases and Limitations
- When exit options are homogeneous () and/or payoffs are deterministic (), symmetric mixed-strategy equilibria can be constructed (hazard rates explicitly specified).
- The jump from to players in THPE induces a sharp complexity transition.
- For proper equilibrium (Myerson), NP-hardness also applies in three-player settings (0812.0492). The two-player computational status remains unresolved.
6. Interpretive Remarks and Theoretical Significance
- Pure-strategy perfect equilibrium refines Nash equilibrium by demanding optimality in every subgame and excluding non-credible threats.
- In dynamic programming treatments, equilibrium existence and computation proceed via Bellman-like recursion over subgame decomposition (subroots/piece-forms), unifying dynamic programming and equilibrium theory.
- Trembling-hand perfection and subgame perfection are robust against vanishing perturbations and payoff changes at future nodes, provided technical conditions (upper/lower convergence).
- The absence of mixed-strategy equilibria in perturbed dynamic games with heterogeneous payoffs highlights the fragility of mixed-strategy constructions to stochasticity and payoff asymmetry (Georgiadis et al., 2021).
7. Practical Applications and Computational Implications
- Large extensive-form games can be solved by recursive value-function methods over subroots, extending backwards-induction to general imperfect-information and infinite-horizon settings.
- For verifying trembling-hand perfection in three-player games, only exponential-time algorithms exist subject to current complexity bounds.
- Existence proofs and computational schemes directly inherit methods from dynamic programming and optimal control theory, relating Bellman equations to equilibrium recursion.
A plausible implication is that refinements such as pure-strategy perfect equilibrium may be preferable in environments featuring payoff stochasticity and heterogeneity, as they yield credibly stable and computationally tractable solutions in settings where mixed strategies become degenerate or untestable.