Distributed Linear Quadratic Regulator
- Distributed LQR is a control strategy that decentralizes the centralized LQR approach by leveraging local and neighboring state information for optimal feedback.
- The κ-distributed architecture applies a tunable locality constraint, yielding an exponential decay in the optimality gap as the communication radius increases.
- Practical implementations balance communication overhead and performance by selecting a truncation radius that scales logarithmically with the desired suboptimality.
A distributed Linear Quadratic Regulator (LQR) is a control strategy for large-scale, networked systems in which the computation and execution of optimal control laws are performed in a decentralized or distributed manner across multiple agents, each with access only to limited local or neighboring information. Distributed LQR theory addresses the synthesis, analysis, and performance limits of such controllers, particularly in the context of communication-constrained, sparsely coupled, or privacy-sensitive agent networks.
1. Problem Formulation and Standard Centralized LQR
Consider a networked, discrete-time linear system with agents, described by a global state and input evolving as
where and have a block structure consistent with the edge set of an undirected graph . Each node has access to a local state and applies control input . The canonical infinite-horizon quadratic cost is
0
where 1, 2 encode performance and effort penalties. Under stabilizability and detectability, the centralized LQR synthesizes a unique optimal state-feedback 3, with 4 derived from the discrete algebraic Riccati equation: 5
6
This solution assumes global state measurements and all-to-all communication, which is impractical for large-scale, distributed systems.
2. κ-Distributed LQR Architecture and Exponential Performance Decay
The 7-distributed LQR architecture introduces a tunable locality constraint by truncating the global gain 8 to a radius-9 neighborhood on the graph: 0 where 1, with 2 the shortest-path distance on 3. Equivalently, 4 if 5, zero otherwise.
The optimality gap of the 6-distributed LQR relative to the centralized solution is quantified by
7
where constants 8 and 9 depend only on control-theoretic and graph growth parameters. Specifically, 0 decays exponentially in the locality radius 1, provided the number of nodes at distance 2 from any node grows subexponentially. This ensures near-centralized performance can be attained with modest communication radii, a key result for scalable distributed control (Shin et al., 2022).
3. Assumptions and Performance Gap Analysis
The analytic guarantees for distributed LQR structure are predicated on:
- Uniform boundedness and well-conditioning of system and cost matrices;
- Stabilizability: existence of 3 such that 4 for 5;
- Detectability (dual conditions on 6);
- Subexponential growth condition for the communication/interaction graph: the number of nodes at distance 7 increases no faster than a subexponential function 8.
Under these, the constants controlling the exponential decay—specifically the decay rate 9—are determined by the worst-case stability margins and the graph growth function. As stabilizability or detectability degrades (0 or 1), convergence to centralized performance is slower. For typical low-dimensional meshes (e.g., grids), the exponential decay in 2 dominates any polynomial growth of 3.
4. Algorithmic Implementation and Communication Structure
The distributed LQR controller with truncation radius 4 is implemented as follows:
- Centralized DARE solution: Compute 5 and 6 offline (system-wide or via distributed policy-iteration).
- Local exchange: Each agent transmits its local state 7 to all agents within 8 hops (performed via 9 synchronous communication rounds, e.g., gossip protocols).
- Local control: Agent 0 collects neighbor states within 1 and applies 2.
- Synchronization burden: Each time step requires 3 communication rounds and matrix-vector multiplication involving only the local 4-neighborhood.
To achieve any prescribed suboptimality 5, it suffices to choose
6
demonstrating logarithmic scaling of communication depth with desired accuracy.
5. Trade-Offs, Locality, and Practical Implications
The 7-distributed LQR architecture interpolates between fully decentralized (8) and fully centralized (9 graph diameter) controllers. The design enables explicit trade-off characterization:
- Communication vs. performance: Increasing 0 reduces the optimality gap exponentially at a cost of increased communication and computational burden.
- Guidelines: Select acceptable suboptimality 1, then compute the minimal 2.
- Graph-theoretic constraints: For networks with subexponential graph growth and moderate degrees, the necessary constants for performance guarantees remain practical even in large-scale systems.
- Empirical insight: In high-dimensional or sparse networks, exchanging states out to a dozen hops recovers 3 of the centralized performance.
6. Extensions and Connections to Broader Distributed Control Literature
The analytic paradigm established by the 4-distributed LQR framework is a foundation for more general distributed and localized LQR synthesis—encompassing, for instance:
- Distributed Riccati recursions and consensus-based policy iteration;
- Block-decreasing structure of 5 as established by block-decay theorems;
- Outward links to system-level synthesis (SLS) approaches that parameterize constrained distributed LQR via affine and subspace constraints, enabling convex formulations of infinite-horizon, locality-constrained optimal control (Kjellqvist et al., 2022);
- Robustness extensions to dropouts, privacy constraints, information attacks, and dynamic topology modifications.
Such frameworks also motivate the development of algorithms for distributed model-free learning of LQR (e.g., using local policy-gradient, Q-learning, or RL approaches), with guarantees on stability, regret, or learning efficiency under distributed information constraints.
7. Limitations and Future Directions
While near-optimal performance is attainable with 6-locality under mild and verifiable assumptions, several challenges remain:
- Computing or approximating the global Riccati solution or 7 in a scalable, fully distributed, and communication-efficient manner for very large networks;
- Handling time-varying, directed, or weighted graphs and nonlinearly coupled agent dynamics;
- Integrating output-feedback, partial observability, or adversarial uncertainty.
- Extending to finite-horizon, receding-horizon, and event-triggered distributed LQR variants, as well as ensuring resilience to faults and communication errors.
The 8-distributed LQR framework is a central analytic milestone, establishing that moderate locality suffices for high-performance distributed stabilization and regulation in complex networked systems (Shin et al., 2022).