Papers
Topics
Authors
Recent
Search
2000 character limit reached

Sinkhorn MPC: Real-Time Transport Control

Updated 30 May 2026
  • Sinkhorn MPC is a real-time control strategy that combines receding-horizon MPC with entropic optimal transport to steer a finite population of linear agents from an initial to a target distribution.
  • It employs iterative Sinkhorn updates to efficiently solve large-scale assignment problems by relaxing hard permutation constraints into smooth, tractable couplings.
  • The framework ensures convergence and stability by utilizing warm-starting, tailored horizon lengths, and careful tuning of regularization and Sinkhorn iteration counts.

Sinkhorn Model Predictive Control (Sinkhorn MPC) is a real-time dynamical transport algorithm for steering a finite population of agents, each governed by linear dynamics, from an initial empirical distribution to a desired target distribution. It synthesizes model predictive control (MPC) for receding-horizon trajectory optimization with entropic-regularized optimal transport, specifically using the Sinkhorn algorithm for tractable real-time assignment. The central innovation is the simultaneous integration of transport planning (target assignment) and control, enabling cost-efficient, scalable population steering under system and input constraints (Ito et al., 2023, Ito et al., 2022).

1. Mathematical Formulation and Problem Structure

Consider NN agents, each with continuous or discrete linear time-invariant dynamics: x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0, with state xi(t)Xix_i(t)\in\mathcal{X}_i and control ui(t)Uiu_i(t)\in\mathcal{U}_i. The control objective is to assign each agent to a distinct target {xj}j=1N\{x_j^*\}_{j=1}^N (permutation σ\sigma), minimizing the infinite-horizon sum of per-agent costs: minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt. Here i()\ell_i(\cdot) is a nonnegative, equilibrium-detecting running cost. This direct problem is computationally intractable due to the factorial-size assignment space and coupled infinite-horizon optimal control subproblems [(Ito et al., 2023), Eqns. (3)-(8)].

Sinkhorn MPC replaces the infinite-horizon problem with a finite-horizon receding surrogate. At each MPC time step, agents solve for cost cTpi(xˇi,xj)c_{T_p}^i(\check x_i, x_j^*) (over horizon TpT_p with terminal constraint x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0,0) and only the first input x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0,1 is applied. Assignment is relaxed to a soft coupling x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0,2, minimizing a linear combination of assignment costs subject to marginal constraints: x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0,3 with x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0,4 and x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0,5 the uniform doubly-stochastic polytope. The entropic regularization parameter x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0,6 controls the smoothness of x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0,7 and interpolates between soft and hard permutations [(Ito et al., 2023), Eq. (22)].

2. Sinkhorn Iterations and Numerical Realization

The entropic-regularized optimal transport problem is efficiently solved via Sinkhorn's matrix-scaling algorithm. The key steps involve computing the Gibbs kernel: x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0,8 and then iteratively updating nonnegative scaling vectors x˙i(t)=Aixi(t)+Biui(t),xi(0)=xi0,\dot x_i(t) = A_i x_i(t) + B_i u_i(t),\quad x_i(0) = x_i^0,9 to satisfy row/column marginals: xi(t)Xix_i(t)\in\mathcal{X}_i0 Here “xi(t)Xix_i(t)\in\mathcal{X}_i1” denotes element-wise division. The soft coupling is xi(t)Xix_i(t)\in\mathcal{X}_i2. In practical MPC integration, only a fixed (often small) number of Sinkhorn updates xi(t)Xix_i(t)\in\mathcal{X}_i3 are executed per time-step, making the approach compatible with real-time constraints and large agent populations [(Ito et al., 2022), (9); (Ito et al., 2023), Eqns. (SMPC_sink) and Algorithm 1].

3. Receding-Horizon Sinkhorn MPC Closed-Loop Dynamics

At each discrete time xi(t)Xix_i(t)\in\mathcal{X}_i4:

  1. States xi(t)Xix_i(t)\in\mathcal{X}_i5 observed; finite-horizon costs xi(t)Xix_i(t)\in\mathcal{X}_i6 formed by solving xi(t)Xix_i(t)\in\mathcal{X}_i7 QPs (parallelizable).
  2. Gibbs kernel xi(t)Xix_i(t)\in\mathcal{X}_i8 computed.
  3. xi(t)Xix_i(t)\in\mathcal{X}_i9 Sinkhorn iterations update scaling vectors; coupling ui(t)Uiu_i(t)\in\mathcal{U}_i0 approximated as above.
  4. Each agent forms a barycentric target via

ui(t)Uiu_i(t)\in\mathcal{U}_i1

  1. Individual MPCs steer each agent towards ui(t)Uiu_i(t)\in\mathcal{U}_i2; plant updated to ui(t)Uiu_i(t)\in\mathcal{U}_i3.

A distinguishing implementation feature is warm-starting the ui(t)Uiu_i(t)\in\mathcal{U}_i4 scaling vector, significantly accelerating Sinkhorn convergence after the first iteration [(Ito et al., 2023), Algorithm 1; Fig. 5].

4. Theoretical Analysis: Convergence, Stability, and Boundedness

4.1. Existence and Nature of Equilibria

Under continuity and differentiability assumptions on the cost and navigator functions, a fixed point ui(t)Uiu_i(t)\in\mathcal{U}_i5 exists such that ui(t)Uiu_i(t)\in\mathcal{U}_i6, where ui(t)Uiu_i(t)\in\mathcal{U}_i7 is the equilibrium coupling [(Ito et al., 2023), Proposition 1].

4.2. Global Convergence under Exact Coupling

If the exact optimal coupling is computed at each step (i.e., enough Sinkhorn iterations to convergence), continuous-time Sinkhorn MPC converges to the equilibrium set under strict positivity of the running cost and technical regularity conditions. The proof uses the entropic-OT cost as a Lyapunov function and applies LaSalle’s principle [(Ito et al., 2023), Theorem 1].

4.3. Ultimate Boundedness and Discrete Realizability

In the discrete-time setting, even with a single Sinkhorn update per step, the closed-loop states are ultimately bounded. Specifically, for any ui(t)Uiu_i(t)\in\mathcal{U}_i8 and for all ui(t)Uiu_i(t)\in\mathcal{U}_i9,

{xj}j=1N\{x_j^*\}_{j=1}^N0

where {xj}j=1N\{x_j^*\}_{j=1}^N1 is the closed-loop MPC matrix, with spectral radius {xj}j=1N\{x_j^*\}_{j=1}^N2, and {xj}j=1N\{x_j^*\}_{j=1}^N3 bounds the navigator range [(Ito et al., 2023), Prop. 3; (Ito et al., 2022), Prop. 4.1].

4.4. Local Asymptotic Stability

For quadratic running cost and unconstrained/weakly-constrained systems, all isolated equilibria of the discrete Sinkhorn MPC scheme are locally asymptotically stable, both in the "over-smoothed" ({xj}j=1N\{x_j^*\}_{j=1}^N4 large) and "hard-assignment" ({xj}j=1N\{x_j^*\}_{j=1}^N5) regimes [(Ito et al., 2023), Theorem 2; (Ito et al., 2022), 4.2]. The construction of a composite Lyapunov function combining state error and Hilbert projective metric on the Sinkhorn scaling vectors supports this claim.

5. Computational Complexity, Algorithmic Steps, and Scalability

The primary computational costs per step are:

  • Formulation of the {xj}j=1N\{x_j^*\}_{j=1}^N6 cost matrix {xj}j=1N\{x_j^*\}_{j=1}^N7 (parallelizable over agents): {xj}j=1N\{x_j^*\}_{j=1}^N8.
  • Sinkhorn iterations: two matrix-vector products per iteration, {xj}j=1N\{x_j^*\}_{j=1}^N9.
  • σ\sigma0 independent MPC optimizations (closed-form in linear-quadratic settings, σ\sigma1 otherwise).

A single Sinkhorn iteration for σ\sigma2 costs σ\sigma30.006 ms, compared to σ\sigma40.1 s for the Hungarian method. For σ\sigma5: Sinkhorn at 0.006 ms, 0.03 ms, 0.08 ms; full LP at 0.12 s, 6.4 s, 66 s, respectively, illustrating the scalability of Sinkhorn MPC. Warm-starting further decreases per-step Sinkhorn iteration count (from σ\sigma6520 to σ\sigma7100 after first step for σ\sigma8) [(Ito et al., 2023), Table 1, Figs. 3–5; (Ito et al., 2022), section 6].

6. Influence of Parameters and Practical Guidelines

  • Horizon σ\sigma9: Should exceed the mixing time required for individual agents' MPC trajectories to nearly reach their (temporary) targets.
  • Entropic regularization minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt.0: Large values favor smooth, distributed assignments and improve transient response, but increase steady-state bias. Small minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt.1 sharpens the assignment towards combinatorial (permutation) solutions, at the cost of increased numerical iterations and possible transient artifacts.
  • Sinkhorn step count minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt.2: One iteration yields higher energy and lower quality transients. minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt.3 or more closely approximates unregularized assignment solutions.
  • Cost shaping: Quadratic minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt.4 enables closed-form MPC updates; non-quadratic and input-constrained cases remain tractable but require additional QP or LP solution per agent [(Ito et al., 2023), Section 5; (Ito et al., 2022), Section 6].

7. Numerical Examples and Benchmark Results

  • Continuous linear agents (quadratic cost): For minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt.5, minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt.6, minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt.7, the method achieves rapid convergence (smooth trajectories), with per-step Sinkhorn costs as noted above.
  • Effect of minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt.8 and minσi=1Nci(xi0,xσ(i)),ci(xi0,xj)=minui0i(xi(t),ui(t);xj)dt.\min_\sigma\, \sum_{i=1}^N c_\infty^i(x_i^0, x_{\sigma(i)}^*),\qquad c_\infty^i(x_i^0, x_j^*) = \min_{u_i} \int_0^\infty \ell_i(x_i(t), u_i(t); x_j^*)\,dt.9: i()\ell_i(\cdot)0 increases transient energy; i()\ell_i(\cdot)1 nearly matches ideal assignment performance; lower i()\ell_i(\cdot)2 sharpens the steady-state but slows convergence.
  • Non-quadratic costs & input constraints: For i()\ell_i(\cdot)3 subject to i()\ell_i(\cdot)4, Sinkhorn MPC executes successfully with i()\ell_i(\cdot)5, i()\ell_i(\cdot)6 for i()\ell_i(\cdot)7, exhibiting sparse control and accurate terminal distribution [(Ito et al., 2023), Figs. 6–8].

References:

  • "Entropic Model Predictive Optimal Transport over Dynamical Systems" (Ito et al., 2023).
  • "Sinkhorn MPC: Model predictive optimal transport over dynamical systems" (Ito et al., 2022).
Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Sinkhorn MPC.