Mean-Field Stochastic LQR Controller
- Mean-Field Stochastic LQR Controller is a control strategy that extends classical LQR by incorporating both individual-state and population-averaged interactions under stochastic uncertainty.
- It employs a Lagrangian dual formulation to decouple mean and fluctuation dynamics, solving coupled Riccati equations for robust, scalable state-feedback synthesis.
- Applications in microgrid frequency control and large-scale power networks demonstrate its ability to reduce overshoots and manage risk through variance-based constraints.
A mean-field stochastic linear quadratic regulator (MF-SLQR) controller generalizes the classical LQR approach to systems with both individual-state and mean-field (population-averaged) interactions under stochastic uncertainty, and further constrains state fluctuations to address risk. Such controllers are crucial in high-dimensional multi-agent networks, power grids, and large-scale coupled stochastic systems, particularly when low-probability, high-impact events must be systematically attenuated rather than averaged away as in the risk-neutral setting.
1. Problem Formulation: Dynamics, Cost, and Risk Constraint
Consider exchangeable agents indexed by with individual state and control . Each agent's dynamics incorporate both local and mean-field coupling: where , , and is an i.i.d. zero-mean noise sequence. In the mean-field limit, agent states decompose into fluctuation and mean components: where , .
The infinite-horizon average quadratic cost per agent (risk-neutral) is
To control rare but impactful fluctuations, a variance-type risk constraint is imposed. Define
where denotes agent 's history. The per-player time-averaged variance is
The MF-SLQR controller seeks to
with risk budget (Roudneshin et al., 2023).
2. Lagrangian Dual Formulation and Decomposition
A Lagrange multiplier combines the nominal and risk costs: Owing to orthogonal mean–fluctuation decomposition, decouples into two independent infinite-horizon LQR objectives: where
and . The dependence on drives risk sensitivity (Roudneshin et al., 2023).
3. Coupled Riccati Equations and Controller Synthesis
Both mean and fluctuation subsystems admit closed-form LQR solutions. The optimal value of is enforced via a primal–dual algorithm.
Riccati Equations
These equations admit a unique positive semidefinite stabilizing solution under standard MF-LQR stabilizability/detectability criteria (Roudneshin et al., 2023).
Controller Law
The optimal state-feedback is affine in the agent's own state and in the population mean: with
and arises if the noise is nonzero-mean in the dual formulation but is zero otherwise (Roudneshin et al., 2023). The feedback structure matches that of classical mean-field LQR, but the gain matrices internalize the variance penalty via the modified cost matrices.
4. Risk Constraint Enforcement and Computational Aspects
The optimal Lagrange multiplier is computed via a primal–dual loop. At each iteration:
- Update , ;
- Solve the two Riccati equations for , ;
- Form , , ;
- Simulate closed-loop dynamics and evaluate ;
- Update by a projected subgradient step: Each iteration requires operations (Riccati/Lyapunov equations) and is independent of the agent population . The resulting gains do not depend on , which ensures scalability (Roudneshin et al., 2023).
5. Structural Properties and Solution Comparison
- Independence from Number of Players: All Riccati equations and controller gains depend only on local and mean-field coupling () and cost weights, not on . This makes the approach viable for large-scale networks (Roudneshin et al., 2023).
- Existence and Uniqueness: Unique stabilizing Riccati solutions and dual multipliers are guaranteed by standard LQR system-theoretic conditions and strong duality (Roudneshin et al., 2023).
- Risk Parameter Influence: As is decreased (tighter variance constraint), increases, which modifies , leading to more conservative (higher-magnitude feedback gains). This reduces the amplitude of risky excursions (e.g., overshoots) at the cost of modestly increased average cost (Roudneshin et al., 2023).
- Comparison to Risk-Neutral MF-LQR: Setting recovers the standard mean-field LQR controller, which ignores state variance (Roudneshin et al., 2023).
6. Practical Example and Performance Analysis
A microgrid frequency control scenario is used as a high-dimensional case paper. Each area (agent) maintains a $4$-dimensional state including local frequency, generation, tie-line flow, and area control error integral. System matrices are drawn from standard load frequency control (LFC) parameters and quadratic costs are assigned.
Table: Impact of Variance Constraint on Controller Behavior
| (risk tolerance) | (dual) | Gain Magnitude | Overshoot | Average Cost |
|--------------------------|--------------------|---------------|-----------|------------------|
| High | | Baseline | Large | Baseline |
| Moderate | Moderate | Increased | Reduced | Slightly higher |
| Low | Large | Largest | Minimal | Highest |
Reducing leads to increased and higher feedback gain magnitude, yielding faster disturbance damping and less vulnerability to rare high-variance events (Roudneshin et al., 2023).
7. Generalizations and Methodological Significance
Risk-constrained MF-SLQR controllers extend classic LQR and mean-field LQR by systematically regulating not only expected cost but also rare-event risk, via variance-type constraints. The affine law and dual-based synthesis parallel but augment the classical mean–fluctuation separation and are compatible with standard Riccati-based implementation. Scalability and algorithmic simplicity are preserved, and the general methodology readily extends to other risk proxies and limit regimes (Roudneshin et al., 2023).
These contributions are directly relevant for modern applications requiring explicit robustness to stochastic volatility and population-level coupling, such as large-scale energy systems and networked autonomous agents.