Hybrid Qf-ES-EKF/UKF: Efficient VIO Fusion
- The paper introduces a hybrid sensor fusion filter that integrates ESKF for full state propagation with a selective SUKF for quaternion-based orientation refinement.
- This method achieves high orientation accuracy while reducing computational cost by over 10× compared to a full SUKF, making it ideal for real-time UAV applications.
- Adaptive visual measurement covariance further enhances robustness by dynamically tuning noise models based on real-time image quality metrics, significantly improving rotation and position estimates.
A Hybrid Quaternion-focused Error-State EKF/UKF (Qf-ES-EKF/UKF) is a sequential sensor fusion technique that combines the computational efficiency of the Error-State Extended Kalman Filter (ESKF) for global state propagation with the high-fidelity nonlinear estimation of a Scaled Unscented Kalman Filter (SUKF) applied selectively to a rotation-critical subspace, specifically the orientation quaternion. The architecture was introduced for robust, adaptive visual-inertial odometry (VIO) in Unmanned Aerial Vehicles (UAVs), with mechanisms for dynamic sensor reliability assessment and measurement noise adaptation to enhance pose estimation under challenging conditions (Asil et al., 19 Dec 2025).
1. State and Error-State Representations
The Qf-ES-EKF/UKF framework utilizes a 16-dimensional nominal state vector:
- Quaternion (4 × 1, body-to-world orientation)
- Velocity (world frame)
- Position (world frame)
- Accelerometer bias
- Gyroscope bias
In vector form:
The associated 15-dimensional error-state vector is:
- Attitude error (so(3) vector)
- Velocity error
- Position error
- Accelerometer bias error
- Gyroscope bias error
With
State correction and composition employ the “ ⊕ ” operator for quaternion error injection:
2. Error-State EKF Propagation
The core propagation leverages ESKF for the full state. The continuous-time nominal dynamics are:
Here, is the quaternion multiplication matrix, and is the rotation matrix from .
The 15-dimensional error-state is propagated with a linear time-varying system:
Discretization uses the Van-Loan method on a block matrix to compute the state-transition matrix and discrete process noise .
The ESKF propagation step at each IMU sample is:
3. Quaternion-Focused SUKF Orientation Refinement
After the global ESKF propagation, a targeted SUKF refinement step is applied solely to the orientation-error covariance block . The process is as follows:
- Generate sigma points in the orientation subspace () using scaled unscented transform parameters .
- Compute Cholesky decomposition , define , and construct .
- Map each sigma point to quaternion perturbations via and apply to the nominal quaternion.
- Each perturbed quaternion is propagated through IMU integration for orientation only.
- Retract propagated quaternions to the nominal orientation using the log map, yielding error vectors .
- Refine using the sigma-point statistics:
- Replace original in the error covariance with , leaving other submatrices unchanged.
4. Sequential Hybrid Propagation and Update
The complete Qf-ES-EKF/UKF filter loop includes:
- ESKF predict for full nominal state and error covariance.
- Extract and refine with SUKF in the quaternion subspace.
- When a visual or zero-velocity measurement becomes available, update the full error state with the standard EKF measurement update using measurement Jacobian and noise .
- Inject estimated error into the nominal state and reset the error state.
This sequence achieves computational savings by limiting the costly SUKF step to a three-dimensional subspace while maintaining full ESKF update for all state variables.
5. Adaptive Visual Measurement Covariance
Measurement noise adaptation is governed by a dynamic sensor confidence score derived from real-time image quality metrics, including:
- Inverted (normalized) entropy, normalized blur, pose-chi error, culled keyframes ratio for position covariance ()
- Derivative metrics (intensity, blur, chi, keyframes) for velocity covariance ()
A combined utility measure ( or ) is calculated as the maximum among individual metrics (static or dynamic), transformed through a CASEF activation function:
Covariance values are selected using a thresholding and interpolation scheme based on confidence scores, ensuring higher noise is assigned to outlier or low-confidence measurements.
The visual measurement covariance in the EKF update is then:
6. Computational Complexity
The computational complexity is as follows:
| Method | Complexity | Steps Included |
|---|---|---|
| Full ESKF | Full state prediction | |
| Full SUKF (16D) | Sigma point expansion | |
| Qf-ES-EKF/UKF (hybrid, 3D UKF) | ESKF + 3D SUKF |
With , , the hybrid yields 252 operations vs. 3375 for full SUKF—a more than reduction in state-propagation step. This enables real-time VIO on UAVs where high-fidelity orientation estimation is critical, without the prohibitive cost of applying a full SUKF to the complete state.
7. Experimental Results and Performance
Evaluation was performed on the EuRoC MAV (MH01–MH05) dataset:
- Rotation accuracy (quaternion RMSE, degrees, avg. over all sequences):
- ESKF: 1.42°
- SUKF: 0.85°
- Qf-ES-EKF/UKF (no adaptive covariance): 0.74°
- Adaptive Qf-ES-EKF/UKF: 0.61°
- Position accuracy (ATE, meters; MH04 + MH05):
- ESKF: 0.56 m
- SUKF: 0.56 m
- Qf-ES-EKF/UKF: 0.53 m
- Adaptive Qf-ES-EKF/UKF: 0.26 m
- Timing (filter only, CPU, s):
- ESKF: 2.44 s (RTF0.92)
- SUKF: 147.2 s (RTF0.092)
- Qf-ES-EKF/UKF: 76.9 s (RTF0.175)
The hybrid approach achieves SUKF-level orientation accuracy at approximately 48% lower computational cost compared to a full SUKF, and only %%%%6970%%%% slower than plain ESKF. This attests to its suitability for real-time UAV VIO, especially in scenarios that are rotation-critical and subject to variable sensor reliability (Asil et al., 19 Dec 2025).