Periodic Stochastic Game-Theoretic Riccati Equations
- Periodic SGTRDEs characterize the value function and optimal control policies in zero-sum LQ stochastic differential games with periodic system data.
- The dual-layer iterative method decomposes complex, sign-indefinite Riccati equations into tractable subproblems, ensuring convergence under stochastic stabilizability and detectability.
- Empirical evaluations demonstrate robust convergence with low iteration counts, validating its use in applications like financial engineering and cyclic control systems.
Periodic time-varying stochastic game-theoretic Riccati differential equations (SGTRDEs) constitute a class of matrix-valued nonlinear differential equations that arise in the optimal control and saddle-point analysis of zero-sum linear–quadratic stochastic differential games (LQ-SDGs) with periodic, time-dependent system data. These equations encode the value function and the optimal policies of two adversarial controllers interacting over a stochastic Itô system with both drift and diffusion coefficients being -periodic functions of time. The stabilizing periodic solutions to SGTRDEs determine global saddle-point optimality criteria and feedback synthesis for high-dimensional, time-varying stochastic systems.
1. Mathematical Formulation and Theoretical Foundations
Consider a filtered probability space supporting an -dimensional Brownian motion , and the state evolution governed by the controlled Itô dynamics: where (maximizer) and (minimizer) are control strategies. The corresponding cost functional of the zero-sum game is
where all data matrices are -periodic and continuous. Saddle-point (value) solutions are determined by a symmetric, matrix-valued function which satisfies a coupled, nonlinear Riccati-type matrix differential equation (the SGTRDE) with the periodic boundary condition . The sign-indefinite control weighting structure is encoded by
reflecting the maximizing () and minimizing () roles.
The full SGTRDE is
where and . The domain is restricted by the quadratic sign-definiteness: , .
Theoretical soundness relies on stochastic stabilizability and detectability notions: an Itô system is stochastically stabilizable if there exists a -periodic feedback making all closed-loop modes mean-square exponentially stable, and stochastically detectable under similar output criteria. Existence and uniqueness of periodic stabilizing solutions are contingent on these properties and the definiteness of .
2. Dual-Layer Iterative Solution Framework
Directly solving the periodic SGTRDE is complicated by its fully coupled, sign-indefinite, nonlinear structure. The introduced solution methodology reformulates the problem as a sequence of bilevel (dual-layer) interconnected subproblems expressed as interlaced iterates of matrix-valued functions: an "outer" sequence and an "inner" sequence , with .
- Initialization: , is the unique -periodic stabilizing solution of a definite-sign inner Riccati equation based on the open-loop case.
- Outer Update: .
- Inner Update: is obtained as the unique -periodic stabilizing solution of a Riccati equation with updated coefficients and an additional correction term .
The correction term and feedback mapping are given by
This iterative mechanism produces a monotone, non-decreasing sequence of outer approximants, with each inner solution computed for a definite-sign Riccati problem conditioned on the current guess.
3. Algorithmic Implementation and Workflow
The proposed algorithm to compute the stabilizing periodic solution is as follows:
- Initialization: Set .
- Compute Initial Inner Solution: Solve the inner Riccati DE with definite quadratic sign to obtain .
- Iterative Update: For :
- ,
- Compute ,
- Compute ,
- Solve inner Riccati (9) for new .
- Stopping Criterion: Terminate when for prescribed tolerance .
- Output: .
This approach ensures the accumulation of monotone corrections and, under appropriate stabilizability and detectability conditions, iteratively converges to the global stabilizing periodic solution.
Summary Table: Iterative Algorithm Structure
| Step | Description | Output |
|---|---|---|
| Initialization | Set | Initial outer approximation |
| Inner Solution | Solve definite-sign Riccati for | Correction term for |
| Outer Update | Updated solution candidate | |
| Stopping Rule | Stop if | Final stabilizing solution |
4. Analysis of Convergence and Theoretical Guarantees
The convergence analysis is grounded in domain invariance, monotonicity, and boundedness of correction sequences. Key statements:
- Domain Invariance: If , the iterates and auxiliary solutions remain within the domain of well-posedness for the generator .
- Auxiliary Existence: If the linear part is stochastically stabilizable/detectable and , each inner Riccati subproblem admits a unique -periodic stabilizing solution.
- Monotonicity: If the outer iterate yields a stable closed-loop, then the new inner solution dominates the previous iterate; thus, .
- Global Convergence: Under the above structural assumptions, the sequences are globally convergent:
- ,
- The limit is the unique -periodic stabilizing solution of the original SGTRDE.
The proof combines contraction arguments for auxiliary Riccati flows and monotone operator theory in the space of periodic symmetric matrix-valued functions.
5. Empirical Performance and Numerical Evaluation
Large-scale Monte Carlo experiments validate the iteration framework. For system orders , 1,000 random trials per (totaling 20,000) are conducted. Data is generated with:
- ,
- ,
- ,
- ,
- ,
- chosen to ensure ,
- Fixed period , with MATLAB default random seed.
Key observations:
- The required number of outer iterations for convergence (tolerance ) is typically $8$–$13$, exceeding $13$ in only $2$ out of $20,000$ trials.
- Inner iteration counts per outer step increase as the algorithm approaches stationarity.
- Low-dimensional systems exhibit higher variability in inner iteration counts, whereas higher-dimensional cases stabilize rapidly.
Qualitative histograms demonstrate consistent convergence behavior across dimensions and random instances, supporting the practical robustness of the dual-layer iteration scheme.
6. Applications and Broader Implications
Solutions to periodic SGTRDEs underpin feedback synthesis for zero-sum stochastic games with periodic coefficients, such as those encountered in financial engineering, signal processing, and systems with seasonal or cyclic behaviors. The presented algorithm provides a unified, numerically stable framework applicable to a broad class of such problems without restrictive simplifications or ad hoc regularization. The explicit dual-layer structure allows decomposition into tractable, definite-sign Riccati subproblems at each step, facilitating both theoretical analysis and scalable implementation.
A plausible implication is that the generality of this framework enables systematic studies and controller synthesis for new subclasses of time-periodic stochastic control problems, potentially extending to more complex multi-agent or non-zero-sum games, given further investigation of analogous structural properties.