Papers
Topics
Authors
Recent
2000 character limit reached

Acceleration-Based Exponential CBFs (A–ECBFs)

Updated 18 December 2025
  • A–ECBFs are safety-critical control methods that enforce exponential decay on barrier function second derivatives to maintain safe sets.
  • They utilize quadratic programming with slack variables to adjust control inputs and integrate these constraints seamlessly into deep learning frameworks.
  • Automatic gain tuning via neural networks enables A–ECBFs to adapt to heterogeneous environments and optimize safety-performance trade-offs.

Acceleration-Based Exponential Control Barrier Functions (A–ECBFs) represent an advancement in safety-critical control theory, extending the framework of Control Barrier Functions (CBFs) to systems characterized by a relative degree of two in their safety outputs. This construction shapes the second derivative of a safety barrier function using exponential decay gains, thereby enabling enforcement of forward invariance for a safe set through quadratic programming constraints. Recent work has demonstrated methods to embed these constraints within differentiable deep learning architectures, facilitating generalization to novel environments and automatic adaptation of safety-performance trade-offs (Ma et al., 2022).

1. Control-Affine Systems and Relative Degree Two

The formal setting for A–ECBFs consists of a general nonlinear, control-affine dynamical system:

x˙=f(x)+g(x)u,xRn, uRm,\dot{x} = f(x) + g(x)u,\qquad x \in \mathbb{R}^n,\ u \in \mathbb{R}^m,

where h(x)h(x) is a twice-differentiable function termed the “safety output”. The forward-invariant set is defined as

C={xh(x)0}.C = \{ x \mid h(x) \geq 0 \}.

For h(x)h(x) with relative degree two, the input uu appears explicitly only in the second derivative h¨(x)\ddot{h}(x). The first derivative, given by Lfh(x)L_f h(x), is independent of uu, while the second derivative,

h¨(x)=Lf2h(x)+LgLfh(x)u,\ddot{h}(x) = L_f^2 h(x) + L_g L_f h(x) u,

provides the direct locus for control action.

2. Exponential Barrier Conditions for Relative-Degree-Two Outputs

The core principle of A–ECBFs is to impose an accelerated, exponential error decay condition coupling h(x)h(x), h˙(x)\dot{h}(x), and h¨(x)\ddot{h}(x). For positive gains k1,k0>0k_1, k_0 > 0, the constraint

h¨(x)+k1h˙(x)+k0h(x)0\ddot{h}(x) + k_1 \dot{h}(x) + k_0 h(x) \geq 0

defines the acceleration-based ECBF condition. This formulation forces the joint state [h h˙][h\ \dot{h}]^\top to converge exponentially to the nonnegative region, thus ensuring invariance of the safe set: once h(x)0h(x)\geq0, it is maintained for all future time under compliant control. This exponential-type policy generalizes the use of class-K\mathcal{K} functions in traditional CBFs by expressing them as linear feedback on augmented barrier states.

3. Quadratic Programming Formulation and Slack Variables

Safety is operationalized by solving a constrained quadratic program (QP) that projects a user-specified, possibly unsafe control udesu_\text{des} onto the set of admissible controls preserving the A–ECBF. The standard formulation is

u,δ=argminu,δ uudes2+pδ2 s.t. h¨(x)+k1h˙(x)+k0h(x)δ,\begin{align*} u^*, \delta^* = \arg\min_{u,\,\delta} &\ \|u - u_\text{des}\|^2 + p\,\delta^2 \ \text{s.t.} &\ \ddot{h}(x) + k_1 \dot{h}(x) + k_0 h(x) \geq -\delta, \end{align*}

where δ0\delta \geq 0 is a slack variable ensuring QP feasibility, and p1p \gg 1 severely penalizes safety violations. At test or deployment time, setting δ0\delta \to 0 restores a hard safety constraint.

4. Gain Selection and Class K\mathcal{K} Functions

Gain selection critically determines the performance-safety profile. For the augmented barrier state

ηb=[h h˙],\eta_b = \begin{bmatrix} h \ \dot{h} \end{bmatrix},

the closed-loop system is

η˙b=[01 k0k1]ηb+[0 1]μ,μ=Lf2h+LgLfhu.\dot{\eta}_b = \begin{bmatrix} 0 & 1 \ -k_0 & -k_1 \end{bmatrix} \eta_b + \begin{bmatrix} 0 \ 1 \end{bmatrix} \mu, \quad \mu = L_f^2 h + L_g L_f h\, u.

A sufficient condition for exponential decay is that the eigenvalues of the dynamics matrix are negative and well separated. Placing them at p1-p_1 and p2-p_2 yields

k1=p1+p2,k0=p1p2.k_1 = p_1 + p_2,\qquad k_0 = p_1 p_2.

This mirrors an exponential class-K\mathcal{K} function on [h,h˙][h,\,\dot{h}], ensuring rapid convergence.

5. Differentiable QP Embedding in Deep Learning Architectures

The ECBF-QP constraint, being convex, can be embedded as a differentiable layer within a neural network controller. Viewed as

minu 12uHu+c(x)us.t.A(x)ub(x),\min_u \ \frac{1}{2} u^\top H u + c(x)^\top u \quad \text{s.t.} \quad A(x) u \geq b(x),

the optimal control mapping u(x)u^*(x) is differentiable almost everywhere, as established by differentiating the Karush–Kuhn–Tucker (KKT) conditions. Efficient routines for this differentiation are available in libraries such as OptNet and through the method of Amos & Kolter. This architecture enables gradient-based end-to-end training across the QP, allowing system and control gains to adapt via backpropagation to trajectory-level loss signals (Ma et al., 2022).

6. Illustrative Example: 2D Double Integrator with Obstacle Avoidance

In the double integrator case (x=[y;v]R4x = [y; v] \in \mathbb{R}^4), let uu represent acceleration. For elliptical obstacle avoidance with center ycy_c and matrix QQ,

h(x)=(yyc)Q(yyc)1h(x) = (y - y_c)^\top Q (y - y_c) - 1

defines the safety output. Derivatives are

h˙=2(yyc)Qv,h¨=2vQv+2(yyc)Qu.\dot{h} = 2(y - y_c)^\top Q v,\qquad \ddot{h} = 2 v^\top Q v + 2(y - y_c)^\top Q u.

The A–ECBF constraint translates to:

2vQv+2(yyc)Qu+k12(yyc)Qv+k0[(yyc)Q(yyc)1]0.2 v^\top Q v + 2(y - y_c)^\top Q u + k_1 \cdot 2(y - y_c)^\top Q v + k_0 [ (y - y_c)^\top Q (y - y_c) - 1 ] \geq 0.

Solving the associated QP enforces collision-free adjustment to the nominal acceleration command.

7. Generalization via Automatic Gain Tuning

Manual tuning of gains is often impractical for heterogeneous environments. A learned “Λ-net” neural network can map environment descriptors (e.g., obstacle features) and initial states to suitable eigenvalues (p1,p2)(p_1, p_2), which define the gains (k1,k0k_1, k_0) for each new scenario. Offline, a trajectory-loss (e.g., total path length or control effort) may be minimized by differentiating through the QP, yielding near-optimal obstacle avoidance across diverse test settings. This approach eliminates the need for per-environment CBF hyperparameter tuning, and empirical results indicate robust generalization of safety policies in randomized environments (Ma et al., 2022).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Whiteboard

Topic to Video (Beta)

Follow Topic

Get notified by email when new papers are published related to Acceleration-Based Exponential Control Barrier Functions (A–ECBFs).