Linear-Quadratic Mean-Field Control
- Linear-Quadratic Mean-Field Control is a framework that couples individual and collective dynamics in large populations using two coupled Riccati equations.
- It employs forward-backward stochastic differential equations to derive explicit feedback laws separating fluctuations from mean-field components.
- The framework underpins scalable and stable control policies applicable to systems like energy grids and multi-agent robotics.
A linear-quadratic mean-field control (LQ-MFC) framework is a rigorous optimal control methodology for large-population or interacting stochastic dynamical systems, where both the state evolution and performance criteria depend linearly on the system state and control, and quadratically on the state, control, or their mean. The framework generalizes classical LQ control to collective or "mean-field" settings by coupling the dynamics and cost through the system's empirical mean, leading to a rich and tractable analysis in both stochastic and deterministic formulations. Characteristically, the optimal control is characterized by a system of forward-backward stochastic differential equations (FBSDEs) for the state and adjoint processes, which is decoupled through two coupled matrix Riccati equations and yields a feedback control separating fluctuation and mean-field components. This structure has broad implications for stability, scalability, and implementation in high-dimensional systems.
1. Problem Formulation and Mean-Field Coupling
The LQ-MFC problem is defined on a probability space with a Brownian motion . For :
- The controlled mean-field SDE is
where is the state, the control, and are deterministic matrix-valued coefficients.
- The cost functional is
with symmetric and, under well-posedness conditions, positive (semi-)definite (Yong, 2011).
Mean-field terms such as induce population-level interdependence, as the dynamics and costs depend not only on each agent’s state and control but also on their collective averages.
2. Pontryagin Principle, MF-FBSDEs, and the Optimality System
The necessary condition for optimality is derived via the stochastic maximum principle. Introducing adjoint processes , the Hamiltonian is
The corresponding adjoint mean-field BSDE reads
with terminal condition .
The pointwise optimality (stationarity) condition specifies:
The full optimality system thus comprises a coupled mean-field forward SDE (for the state) and backward SDE (for the adjoint), together with the stationarity condition (Yong, 2011).
3. Riccati Decoupling and Feedback Law Construction
Decoupling the mean-field FBSDE is achieved via an affine ansatz:
The decoupling yields two deterministic matrix Riccati ODEs:
- For fluctuations (“variance Riccati” ):
- For the mean-field component (“mean Riccati” ):
Under standard positive definiteness assumptions, these Riccati ODEs admit unique, symmetric solutions (Yong, 2011).
The resulting feedback representation for the optimal control is: This exhibits a decomposition into feedback gains for the deviation and for the mean, with the mean-field terms governing overall collective regulation and the fluctuation part ensuring stabilization of deviations.
4. Structural Features, Well-Posedness, and Special Cases
A defining characteristic of the LQ-MFC framework is the necessity of solving two coupled Riccati equations, instead of only one as in classical LQ problems. The variance Riccati governs the stabilization of stochastic deviations from the mean, while the mean Riccati regulates the evolution of the population mean.
- When all mean-field coefficients vanish (), the framework reduces to the standard LQ setting, with and a single Riccati equation (Yong, 2011).
- Existence and uniqueness of the solution rely on the uniform positive definiteness of , the non-negativity of , and , which ensure convexity and coercivity of the cost functional.
A table summarizing key elements:
| Component | Classical LQ | LQ-MFC |
|---|---|---|
| State equation | SDE, no mean-field | SDE with mean and control mean-field coupling |
| Cost functional | Quadratic in state/control | Quadratic in state/control/means |
| Riccati equations | One matrix ODE | Two coupled matrix ODEs (variance and mean) |
| Feedback law | Linear state feedback | Linear feedback on deviation and mean |
| Well-posedness | One positivity condition | Stronger, two-fold positivity in Riccati solutions |
5. Ergodic and Infinite-Horizon Structure
For infinite-horizon mean-field LQ control with constant coefficients, the finite-horizon Riccati ODEs converge exponentially fast to the unique stabilizing solutions of the corresponding algebraic Riccati equations (AREs). This allows for construction of steady-state, time-homogeneous feedback laws and explicit characterizations of long-term average costs (Bayraktar et al., 13 Feb 2025, Huang et al., 2012). The solution is structured by splitting the system into:
- A fluctuation subsystem, controlled by the “variance” Riccati;
- A mean subsystem, controlled by the “mean” Riccati.
In this regime, the turnpike property applies: optimal finite-horizon trajectories converge exponentially (except near the time boundaries) to the ergodic limit, which is the solution of the infinite-horizon feedback. This property justifies using stationary policies for long-run applications (Bayraktar et al., 13 Feb 2025).
6. Extensions and Applications
The LQ-MFC framework extends to several variants:
- Indefinite quadratic costs, handled via relaxed compensator techniques and further algebraic analysis (Li et al., 2020).
- Infinite-population and mean-field-team formulations, where decentralized controllers and social optima can be constructed and analyzed for scalability and robustness (Arabneydi et al., 2016, Wang et al., 2020).
- Systems with heterogeneous agents, cluster-based structures, and distributed information networks, leading to block-diagonal and local-global Riccati structure (Liang et al., 2022).
- Incorporation of risk constraints, leading to risk-adjusted Riccati equations and affine control laws insensitive to population size (Roudneshin et al., 2023).
Real-world applications arise in macroeconomic planning, networked control of multi-agent systems, distributed energy grids, and large-scale robotics, wherever systemic state or control averages are crucial performance determinants.
7. Theoretical and Computational Considerations
Solving the LQ-MFC problem involves integrating two coupled Riccati equations, with complexity independent of the agent population; thus the method is well-suited for large-scale systems. The resulting closed-loop laws are explicit and robust to both modeling details and information structure, enabling distributed implementation (Yong, 2011, Arabneydi et al., 2016). The theory also provides transparent sufficient conditions for existence, uniqueness, and stabilizability of the problem. Extensions to random coefficients, backward SDEs, and entropy-regularized reinforcement learning formulations further broaden the framework’s applicability (Xiong et al., 7 Jun 2024, Xiong et al., 1 Mar 2025, Frikha et al., 2023).
The linear-quadratic mean-field control framework thus forms a foundational methodology for scalable control of high-dimensional stochastic systems with nontrivial collective behavior, offering both technical depth and practical tractability.