Constrained Limit-Average Optimal Control
- Constrained limit-average problems are defined as optimizing long-term average cost under asymptotic constraints, critical for control and geometric optimization.
- The framework utilizes occupational measures and Carathéodory's theorem to reduce infinite-horizon problems to a finite mix of stationary and periodic solutions.
- Applications include Cheeger sets and PDE singular limits, demonstrating practical cyclic switching strategies for satisfying averaged constraints.
Constrained limit-average problems concern the minimization (or maximization) of a long-run average cost or reward under one or more averaged (asymptotic) constraints on system trajectories or policies. Such formulations appear across stochastic control, game theory, online learning, and geometric optimal control, providing a unifying lens to address performance under both objective and constraint satisfaction in the asymptotic regime.
1. Mathematical Formulation and Occupational Measure Framework
Consider a continuous-time control system on the plane with state , control , and dynamics
Let denote the cost density and () constraint densities. The weighted limit-average (or time-average) cost is
Each constraint is enforced on time averages: Assume compact; , , continuous and Lipschitz in ; and uniform controllability (from any to in in bounded time). The admissible controls keep the state in for all .
To handle the infinite-horizon averaging, the problem is reformulated in terms of limiting occupational measures . For a sequence , and given trajectories , the empirical measures
converge weakly to some limiting measure . The set is compact and convex. The average cost and constraints are expressed as and . Thus, the constrained limit-average problem becomes
This framework admits ratio objectives of the form , with everywhere, crucial for applications such as shape optimization (Bright, 2013).
2. Structure of Optimal Solutions: Poincaré–Bendixson Type Results
A foundational result is that, for planar systems under the above hypotheses, every extreme point of the set of achievable occupation measures is realized either by a stationary solution (, ) or a periodic solution whose trace is a Jordan curve.
For unconstrained problems, the minimum of the limit-average (or ratio) cost is realized by a stationary or single periodic solution. If constraints are introduced, Carathéodory's theorem ensures that the optimal occupation measure is a convex combination of at most extreme points, corresponding to at most stationary or periodic orbits. The control can therefore be implemented by cyclic switching among these orbits according to the convex coefficients, ensuring the averaged constraints are satisfied. When and the constraint is cumulative (holds for all finite ), the switching schedule can enforce this strong constraint at all times, not just asymptotically (Bright, 2013).
3. Principal Lemmas and Proof Constructs
The existence theory and structure results hinge on several key lemmas:
- Compactness of the occupational measure set : follows from compactness of and tightness criteria.
- Convexity under uniform controllability: if every to transition is possible, convex combinations of occupation measures associated to initial states/controls can also be realized.
- Carathéodory-type lemma: the intersection of with constraint hyperplanes has extreme points that are convex combinations of at most extreme points of .
- Quasi-concavity of the ratio functional : on the convex set .
Together, these imply that limit-average optimal control in the plane reduces, in both constrained and unconstrained cases, to analysis of a finite number of stationary or periodic arcs.
4. Applications: Cheeger Sets and PDE Singular Limits
The theory directly connects to two prominent geometric and variational problems:
Cheeger sets: The Cheeger ratio can be phrased as
where parameterizes as a positively oriented Jordan curve with fixed-speed parametrization. This is a limit-average problem with no constraints, and the existence theory guarantees extremal solutions are single periodic (i.e., the boundary traces an optimally shaped Cheeger set) (Bright, 2013).
Generalized Cheeger sets: When area and perimeter are weighted, the same structural conclusion holds.
Singular limits in PDEs: The stationary states of the 1D van der Waals/Cahn–Hilliard functional,
reduce, as , to a constrained limit-average problem for a 1D ODE with an averaged mass constraint. The minimizer cycles between stationary or periodic orbits, yielding a lamellar two-phase structure matching known singular patterns observed in this limit.
5. Significance, Limitations, and Further Directions
The occupational measure framework subsumes exact solution existence for both unconstrained and averaged-constrained planar limit-average optimal control, allowing a reduction from infinite-horizon behavior to a finite-dimensional, finitely-cyclic strategy (number of cycles at most for constraints). This framework bridges the gap between classical calculus of variations, optimal control, and geometric problems, yielding structural results (e.g., periodicity or cyclical switching for the optimal control law) that are not generally available in higher-dimensional or more complex systems.
Key limitations include the restriction to planar systems (for the periodic-or-stationary extreme point structure) and the need for compactness and controllability assumptions. Extensions to higher dimensions or non-convex control systems may not retain these properties. Further, while existence and structure results are comprehensive, deriving explicit switching schedules or trajectories in practice may require additional regularity or computational insights, especially as the number of constraints increases.
6. Summary Table: Core Theoretical Results
| Regime | Existence Result | Structure of Solution |
|---|---|---|
| No constraints | Minimum at periodic or stationary orbit | Single cycle or stationary trajectory |
| constraints | Minimum is convex comb. of | Cyclic switching among at most orbits |
| , cumulative | Strong constraint holds for all | Schedule ensures constraint always met |
This synthesis provides the existence theory, geometric structure, and practical realization of constrained limit-average control in the planar setting, serving as a foundation for further developments in infinite-horizon variational problems and geometric optimization (Bright, 2013).