Papers
Topics
Authors
Recent
Search
2000 character limit reached

Bussgang-Aided KalmanNet

Updated 31 May 2026
  • The paper introduces a framework combining analytic Bussgang decomposition with deep recurrent neural networks to achieve high-accuracy state estimation under severe 1-bit quantization.
  • It employs a Bussgang-aided Kalman filter and its reduced-complexity variant, significantly reducing computational cost without major RMSE performance loss.
  • The architecture leverages GRU modules to adaptively learn Kalman gains, enhancing robustness against model mismatches and non-Gaussian quantization distortions.

Bussgang-aided KalmanNet is a state estimation framework designed for high-accuracy filtering in systems with 1-bit quantized observations and possibly imperfect process and measurement models. The approach combines analytic linearization based on the Bussgang decomposition with deep recurrent neural networks to mitigate the severe information loss induced by 1-bit quantization and to compensate for model mismatch. The algorithm extends traditional Kalman and hybrid filtering methodologies, providing robustness and computational efficiency in settings where quantization distortion and non-idealities are prominent (Jung et al., 23 Jul 2025).

1. System Model and 1-Bit Quantization

The foundation of Bussgang-aided KalmanNet is a discrete-time nonlinear state-space model: xt=f(xt1)+wt,wtN(0,Qt)x_t = f(x_{t-1}) + w_t,\qquad w_t\sim\mathcal{N}(0,Q_t)

yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)

where xtRmx_t\in\mathbb{R}^m represents the latent state, and ytRny_t\in\mathbb{R}^n is the pre-quantized sensor measurement. Gaussian process noise wtw_t and measurement noise vtv_t are assumed mutually independent. Subsequently, the real-valued measurement is quantized by a 1-bit element-wise quantizer—possibly employing a dithering technique based on predictive centering: zt=ytτt,τt=y^tt1z_t = y_t - \tau_t, \quad \tau_t = \hat y_{t|t-1}

rt=Q(zt),[Q(z)]i={+1,zi>0 1,zi0r_t = Q(z_t), \quad [Q(z)]_i = \begin{cases} +1,& z_i > 0 \ -1,& z_i \le 0 \end{cases}

The estimator aims to minimize mean squared state reconstruction error given binary observations: x^tt=argminx^tE[xtx^t2r1,...,rt]\hat x_{t|t} = \arg\min_{\hat x_t}\,\mathbb{E}\bigl[\|x_t-\hat x_t\|^2\,|\,r_{1},...,r_{t}\bigr] 1-bit quantization, especially under non-dithered or asymmetric regimes, introduces significant distortion and non-Gaussianity, invalidating most classic Kalman-type solutions without substantial modification.

2. Bussgang Decomposition and Linear Surrogate

The Bussgang theorem is leveraged under the assumption that the quantizer input ztz_t is (approximately) zero-mean Gaussian. The decomposition establishes a linear relationship between quantizer output and input through: yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)0

yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)1

yielding the surrogate linear model: yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)2 Here, yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)3 is an additive, zero-mean, uncorrelated distortion with covariance: yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)4

yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)5

This conversion enables 1-bit quantized observations to be processed using a Kalman-like linear update.

3. Bussgang-aided Kalman Filter and Reduced Complexity Variant

The Bussgang-aided Kalman Filter (BKF) propagates predictions as in the extended Kalman filter, but uses the linearized Bussgang measurement model for updates:

  • Prediction:

    yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)6

    yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)7

  • Update with the Bussgang gain:

    yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)8

    yt=h(xt)+vt,vtN(0,Rt)y_t = h(x_t) + v_t,\qquad v_t\sim\mathcal{N}(0,R_t)9

    xtRmx_t\in\mathbb{R}^m0

The update treats xtRmx_t\in\mathbb{R}^m1 as a noisy, linear measurement.

For high-dimensional observations, an order-of-magnitude computational reduction is achieved using the reduced-complexity variant (rBKF). This introduces a projection matrix xtRmx_t\in\mathbb{R}^m2 (block averages, etc.), forming: xtRmx_t\in\mathbb{R}^m3

xtRmx_t\in\mathbb{R}^m4

All update equations use these projected quantities, reducing matrix inversion cost from xtRmx_t\in\mathbb{R}^m5 to xtRmx_t\in\mathbb{R}^m6.

4. Bussgang-aided KalmanNet Architecture

To address imperfect or partially known process/measurement models, Bussgang-aided KalmanNet (BKNet) parameterizes the gain computation as a learned function: xtRmx_t\in\mathbb{R}^m7

xtRmx_t\in\mathbb{R}^m8

The gain module xtRmx_t\in\mathbb{R}^m9 is realized by a sequence of three gated recurrent unit (GRU) blocks:

  • GRUytRny_t\in\mathbb{R}^n0: captures dependency on process noise ytRny_t\in\mathbb{R}^n1
  • GRUytRny_t\in\mathbb{R}^n2: models prior state covariance ytRny_t\in\mathbb{R}^n3
  • GRUytRny_t\in\mathbb{R}^n4: encodes measurement covariance and quantizer nonlinearity

Inputs incorporate estimated state differences (ytRny_t\in\mathbb{R}^n5), residual differences (ytRny_t\in\mathbb{R}^n6), and matrix embeddings via fully-connected layers. A final fully-connected layer maps GRU outputs to the gain estimate. This modular, recurrent design preserves interpretability and low parameter count while enabling data-driven adaptation to unknown or non-Gaussian regimes.

5. Training Methodology and Pseudocode

BKNet employs a mean-square error loss with ytRny_t\in\mathbb{R}^n7 regularization: ytRny_t\in\mathbb{R}^n8 Training uses the Adam optimizer with a learning rate of ytRny_t\in\mathbb{R}^n9, batch size wtw_t0, and regularization coefficient wtw_t1. Two types of data are used:

  • Simulated: Lorenz attractor (sequence length 100, Gaussian noise, dithering quantization)
  • Real-world: NCLT robot dataset (GPS/odometry, 6-dimensional state, 2-dimensional measurement, 1 Hz sampling; train/test on separate days)

Dithering centers the quantizer inputs at current predictions (wtw_t2), improving symmetry and Fisher information.

The update procedure follows the pseudocode:

vtv_t4 with wtw_t3 and wtw_t4.

6. Empirical Results and Performance

Key measured results:

Scenario RMSE (dB) Notes
Lorenz, EKF/KalmanNet (ideal) wtw_t5
Lorenz, EKF/KalmanNet (1-bit) fail (wtw_t612–18) Unable to track accurately
Lorenz, BKF/rBKF (1-bit) wtw_t7 Analytic Bussgang handling
Lorenz, BKNet (1-bit, mismatched models) wtw_t8 Robust to model errors, Taylor expansions, rotational mismatches
NCLT, KalmanNet (ideal) wtw_t9 Real robot data
NCLT, EKF/KalmanNet (1-bit) fail State estimation fails under strong quantization
NCLT, BKF (1-bit) vtv_t0 High accuracy under 1-bit quantization
NCLT, BKNet (1-bit) vtv_t1 Matches KalmanNet with ideal (unquantized) data

Reduced-complexity BKF with vtv_t2 ADCs at the sensor achieves a runtime decrease by a factor of 4–10 compared to the full BKF with negligible RMSE increase.

7. Theoretical Foundations and Interpretational Insights

Crucial to the performance of Bussgang-aided KalmanNet are several theoretical conditions:

  • Centering quantizer inputs via dithering is critical, ensuring a computable diagonal structure for Bussgang gains and maximizing information extraction.
  • For (near-)Gaussian residuals, the Bussgang decomposition ensures that the linear surrogate maintains second-order statistics of the original quantized system. Thus, Kalman recursions yield (approximately) optimal LMMSE state estimates.
  • The residual noise vtv_t3 is approximated as Gaussian, with empirical justification in cases of moderate distortion or when large numbers of parallel 1-bit channels are aggregated.
  • Within the BKNet extension, GRU modules flexibly adapt to biases, higher-order nonlinearities, and mismatches. The universal approximation capabilities of recurrent neural networks, under sufficient data and capacity, guarantee that the true time-varying gains can be learned.

Together, the Bussgang-aided Kalman filter and its neural extension KalmanNet form a principled, modular, and data-adaptive framework for nonlinear state estimation under severe quantization, robust to model errors and suitable for both simulated and real-world settings (Jung et al., 23 Jul 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Bussgang-aided KalmanNet.