Papers
Topics
Authors
Recent
2000 character limit reached

ND-iBR: Decentralized Game Dynamics

Updated 22 November 2025
  • ND-iBR is a decentralized game-theoretic method where agents update strategies using local information, enabling efficient approximation of Nash equilibria.
  • It employs an iterative best-response mechanism localized to dynamic neighborhoods to drastically reduce computational complexity and enhance scalability.
  • Empirical evaluations in multi-agent motion planning and networked evolutionary games demonstrate ND-iBR’s robust performance, fast convergence, and improved safety.

Neighborhood-Dominated iterative Best Response (ND-iBR) denotes a class of decentralized game-theoretic dynamics in which agents update their strategies based on local information from neighboring agents, leading to scalable, efficient computation of (approximate) Nash equilibria in multi-agent settings. ND-iBR extends classical iterative best-response schemes by explicitly exploiting problem structure that limits agent interactions to local neighborhoods, yielding both theoretical convergence guarantees and practical computational gains, particularly in dynamic potential games and networked evolutionary games (Mai et al., 15 Nov 2025, Chellig et al., 2020).

1. Game-Theoretic Foundations

ND-iBR operates in environments composed of agents N={1,,N}\mathcal{N} = \{1, \ldots, N\}, each with its own state xiXix_i \in \mathcal{X}_i and control or strategy uiUiu_i \in \mathcal{U}_i, evolving via potentially nonlinear, discrete-time dynamics xi(t+1)=fi(xi(t),ui(t))x_i^{(t+1)} = f_i(x_i^{(t)}, u_i^{(t)}) (Mai et al., 15 Nov 2025). Agents optimize finite-horizon cost functionals of the form

Ji(x(0),(ui,ui))=t=0T1Li(x(t),ui(t))+LiF(x(T)),J_i(x^{(0)}, (u_i, u_{-i})) = \sum_{t=0}^{T-1} L_i(x^{(t)}, u_i^{(t)}) + L_i^F(x^{(T)}),

where the instantaneous and terminal terms can be decomposed into “global” potential terms, coupling all agents, and agent-independent or “remainder” components: \begin{align*} L_i(x{(t)}, u_i{(t)}) &= P(x{(t)}, u{(t)}) + \Theta_i(x_{-i}{(t)}, u_{-i}{(t)}) \ L_iF(x{(T)}) &= R(x{(T)}) + \Xi_i(x_{-i}{(T)}). \end{align*} Such structure admits formulation as a dynamic potential game, in which simultaneous minimization of the global potential Φ(u)=t=0T1P(x(t),u(t))+R(x(T))\Phi(u) = \sum_{t=0}^{T-1} P(x^{(t)}, u^{(t)}) + R(x^{(T)}) aligns with finding Nash equilibria (Mai et al., 15 Nov 2025).

2. The iε-BR Process and the Emergence of ND-iBR

The classical iterative ε\varepsilon-best response (iε\varepsilon-BR) provides a constructive path to approximate Nash equilibria. At each iteration, an agent ii with possible strategy improvement Ji(x(0),(uik,uik))infviJi(x(0),(vi,uik))εJ_i(x^{(0)}, (u_i^k, u_{-i}^k)) - \inf_{v_i} J_i(x^{(0)}, (v_i, u_{-i}^k)) \geq \varepsilon solves the local optimization subproblem

uik+1argminuiJi(x(0),(ui,uik)),u_i^{k+1} \in \arg\min_{u_i} J_i(x^{(0)}, (u_i, u_{-i}^k)),

with other agents’ strategies unchanged (Mai et al., 15 Nov 2025). Each such update strictly decreases the global potential by at least ε\varepsilon, ensuring finite-step convergence in compact, continuous games.

ND-iBR modifies this process by localizing the coupling in each agent’s cost to a dynamic neighborhood Ni(t)={ji:xi(t)xj(t)<dProx}\mathcal{N}_i^{(t)} = \{j \neq i : \|x_i^{(t)} - x_j^{(t)}\| < d^{\text{Prox}}\}, yielding a local cost functional

J~i(x(0),(ui,ui))=t=0T1L~i(x(t),ui(t),{uj(t)}jNi(t))+L~iF(x(T),{xj(T)}jNi(T))\tilde{J}_i(x^{(0)}, (u_i, u_{-i})) = \sum_{t=0}^{T-1} \tilde{L}_i\left(x^{(t)}, u_i^{(t)}, \{u_j^{(t)}\}_{j\in\mathcal{N}_i^{(t)}}\right) + \tilde{L}_i^F\left(x^{(T)}, \{x_j^{(T)}\}_{j\in\mathcal{N}_i^{(T)}}\right)

where interaction terms are restricted to current neighbors. ND-iBR then replaces JiJ_i with J~i\tilde{J}_i in the iε\varepsilon-BR routine, driving decentralized updates (Mai et al., 15 Nov 2025).

3. Distributed Algorithm and Termination Properties

The essential distributed ND-iBR scheme proceeds as follows (Mai et al., 15 Nov 2025):

  1. Neighbor Trajectory Exchange: Each agent ii at iteration kk collects nominal trajectories {u~jk}jNi(t)\{\tilde{u}_j^k\}_{j \in \mathcal{N}_i^{(t)}} of current neighbors.
  2. Local MPC-style Update: Agent ii solves

minuiJ~i(x(0),(ui,u~ik)),subject to dynamics and J~iinfviJ~iε,\min_{u_i} \tilde{J}_i(x^{(0)}, (u_i, \tilde{u}_{-i}^k)), \quad \text{subject to dynamics and} \ \tilde{J}_i - \inf_{v_i} \tilde{J}_i \geq \varepsilon,

typically via a general-purpose nonlinear solver (e.g., IPOPT).

  1. Improvement Assessment: Compute rik=Ji(uik,uik)Ji(ui,uik)r_i^k = J_i(u_i^k, u_{-i}^k) - J_i(u_i^\star, u_{-i}^k).
  2. Termination: If maxirik<ε\max_i r_i^k < \varepsilon, declare uku^k an ε\varepsilon-Nash equilibrium; else select the maximizing iki_k, set uk+1=(uik,uikk)u^{k+1} = (u_{i_k}^\star, u_{-i_k}^k), and repeat.

Termination within at most (Φ(u0)Φmin)/ε(\Phi(u^0) - \Phi_{\min}) / \varepsilon iterations is guaranteed, provided that each strategy set Ui\mathcal{U}_i is compact and cost functionals satisfy the DPG decomposition. The final profile uεu^\varepsilon satisfies the ε\varepsilon-Nash condition: Ji(uiε,uiε)infviJi(vi,uiε)+ε,i.J_i(u_i^\varepsilon, u_{-i}^\varepsilon) \leq \inf_{v_i} J_i(v_i, u_{-i}^\varepsilon) + \varepsilon, \quad \forall i.

4. ND-iBR in Networked Evolutionary Games

For 2×22 \times 2 symmetric games on random graphs G(n,p)G(n,p), ND-iBR specializes to a local update wherein each vertex vv selects the strategy maximizing its payoff against its neighbors’ current choices. The update reduces to generalized majority or minority dynamics, parameterized by payoff-skew λ\lambda,

λ=q1,1q0,1q0,0q1,0,\lambda = \frac{q_{1,1} - q_{0,1}}{q_{0,0} - q_{1,0}},

dictated by the payoff matrix QQ (Chellig et al., 2020). In the majority regime, vertices synchronize to the local majority strategy, and, for sufficiently dense graphs (pn1/2p \gg n^{-1/2}), the system attains unanimity in at most four synchronous rounds. In the skewed (λ1\lambda \neq 1) case, critical connectivity thresholds pc(λ)p_c(\lambda) precisely characterize when the largest component achieves consensus; below threshold, discordant “blocking stars” persistently impede full synchronization.

5. Computational Complexity and Scalability

ND-iBR leverages locality to realize substantial reductions in per-iteration computational burden. With all-to-all coupling, each agent’s subproblem has O(N)O(N) constraints, yielding total coupling O(N2)O(N^2). Restricting to local neighborhoods, the coupling reduces to O(NNi)O(N \cdot \overline{|\mathcal{N}_i|}), where empirical studies find NiN\overline{|\mathcal{N}_i|} \ll N and growing sublinearly as agent population increases (e.g., 0.75.50.7 \rightarrow 5.5 as N=315N=3 \rightarrow 15) (Mai et al., 15 Nov 2025). This yields 2–3×\times reductions in effective coupling and commensurate improvements in solver runtime.

6. Empirical Performance in Multi-Agent Motion Planning

Monte Carlo experiments for multi-agent quadrotor navigation under stochastic disturbances demonstrate that ND-iBR within the Reachability-Enhanced Dynamic Potential Game (RE-DPG) framework maintains stable tracking cost across agents (NN) and noise (σ\sigma), outperforming probabilistic iLQR and DRL baselines in time efficiency and collision avoidance (Mai et al., 15 Nov 2025). Results indicate

  • Consistently low tracking costs across varying NN and σ\sigma
  • Near-zero residual distance to goals within horizon, matching or improving upon baselines
  • Zero collision incidence in all tested scenarios (with MA-FRS enabled); ablation of MA-FRS yields collisions at higher NN or σ\sigma
  • Average per-agent MA-FRS propagation 100\approx 100 ms, ND-iBR per step <0.2< 0.2 s, validating real-time feasibility for receding-horizon planning

A plausible implication is that ND-iBR, especially when combined with explicit uncertainty propagation (via MA-FRS), achieves robust, scalable, and safe coordination in dynamic, highly interactive environments.

7. Theoretical Limits and Network Effects

In dense interaction graphs, ND-iBR rapidly stabilizes collective strategies under broad classes of payoff matrices. However, below critical network connectivity, persistent local dissent arises, fully characterized via "blocking" substructures such as (,k)(\ell,k)-blocking stars. The frequency and persistence of such structures are quantified in terms of random graph parameters and Poisson statistics at threshold regimes (Chellig et al., 2020). This demonstrates the sensitivity of ND-iBR convergence both to local coupling rules and the underlying interaction topology.

Slide Deck Streamline Icon: https://streamlinehq.com

Whiteboard

Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Neighborhood-Dominated iterative Best Response (ND-iBR).