Interacting Multiple Model Proprioceptive Odometry for Legged Robots

Published 31 Mar 2026 in cs.RO | (2603.29383v1)

Abstract: State estimation for legged robots remains challenging because legged odometry generally suffers from limited observability and therefore depends critically on measurement constraints to suppress drift. When exteroceptive sensors are unreliable or degraded, such constraints are mainly derived from proprioceptive measurements, particularly contact-related leg kinematics information. However, most existing proprioceptive odometry methods rely on an idealized point-contact assumption, which is often violated during real locomotion. Consequently, the effectiveness of proprioceptive constraints may be significantly reduced, resulting in degraded estimation accuracy. To address these limitations, we propose an interacting multiple model (IMM)-based proprioceptive odometry framework for legged robots. By incorporating multiple contact hypotheses within a unified probabilistic framework, the proposed method enables online mode switching and probabilistic fusion under varying contact conditions. Extensive simulations and real-world experiments demonstrate that the proposed method achieves superior pose estimation accuracy over state-of-the-art methods while maintaining comparable computational efficiency.

Abstract PDF Upgrade to Chat

Authors (5)

Summary

The paper introduces an IMM-based proprioceptive odometry framework that integrates rolling-aware error-state Kalman filtering for robust state estimation under diverse contact conditions.
It fuses IMU and joint encoder data with adaptive mode probabilities to accurately handle rolling and slipping contacts, reducing drift significantly.
Evaluation in simulation and real-world trials shows over 40% error reduction and superior performance compared to conventional and learning-based filters.

IMM-Based Proprioceptive Odometry for Legged Robots: Formulation, Implementation, and Evaluation

Motivation and Problem Formulation

Legged robots necessitate robust state estimation under diverse and unstructured environments. Proprioceptive odometry (PO), primarily leveraging IMUs and joint encoders, serves as the core solution when exteroceptive sensors are compromised. Conventional PO methodologies rely on the static point-contact assumption, rendering legged odometry vulnerable to drift and degraded accuracy in real-world locomotion, particularly under dynamic terrain interactions such as foot rolling and slippage. The presented work addresses this limitation by proposing an interacting multiple model (IMM)-based PO framework capable of online contact mode inference and robust fusion.

Figure 1: Illustration of rolling and slippery contacts. As the robot advances, foot-ground interaction transitions between rolling and slip dynamics.

The primary innovation lies in explicitly modeling the rolling contact—characterized by deterministic, nonzero stance foot velocity—while simultaneously handling unpredictable slippage via mode-dependent process noise augmentation. The state estimator, based on rolling-aware error-state Kalman filtering (ESKF), is integrated within an IMM architecture, enabling probabilistic adaptation and fusion of multiple contact hypotheses.

Rolling-Aware State Estimation Architecture

The complete pipeline instantiates dual parallel ESKF modules, corresponding to rolling and slippery contact models, and fuses their estimates via mode probabilities determined through likelihood evaluation. The estimator state is augmented with foot velocities to directly capture structured contact-induced motion deviations. The mode probabilities and fused state, updated in real-time, allow accurate pose estimation even under abrupt transitions or terrain irregularities.

Figure 2: Overview of the proposed IMM-based proprioceptive state estimation, fusing IMU and encoder data across parallel models for rolling and slip dynamics.

Key elements include:

Sensor fusion: IMU and joint encoder measurements.
Rolling-aware update: Explicit constraint through augmented foot velocity state.
Mode switching: Process noise covariance inflation for slip scenarios, performed within the IMM inference loop.
Efficient error-state representation: Orientation errors managed via rotation vectors, aligning with manifold-based filtering approaches.

Experimental Validation: Simulation and Real-World Scenarios

The framework was evaluated in Gazebo-based simulation and on the AlienGo quadruped robot across a spectrum of conditions—flat, slippery, uneven, and slope terrains. Snapshots illustrate the complexity of tested environments.

Figure 3: Snapshots of four simulation scenarios, covering flat, slippery, uneven, and slope terrains.

Simulation Performance

Straight line trajectories on flat terrain demonstrate that rolling-aware estimation (ESKF-R) reduces position error (ATE/RPE) by >40% compared to invariant EKF (IEKF). IMM-PO further decreases errors by >17%. In circular and sinusoidal scenarios, IMM-PO achieves up to 70% reduction in position ATE over IEKF and substantial attitude error improvements.

Figure 4: Comparison of estimated and ground truth trajectories; nonzero foot velocity profiles corroborate the rolling contact assumption.

In slippery scenarios, IMM-PO effectively mitigates drift—ATE reduced almost 70% relative to conventional models. On uneven and slope terrains, IMM-PO's adaptive fusion maintains minimized drift even as rolling models are only approximations. Mode probabilities track the contact phase, evidencing adaptive switching (Figure 5).

Figure 6: Estimated trajectories and attitude errors in the slippery scenario, showing IMM-PO's superior drift correction.

Figure 5: Mode switching dynamics in IMM-PO correlate with ground truth foot velocity changes, reflecting responsiveness to slip events.

Real-World Results

Two scenarios (flat and complex terrain) were carried out indoors with high-precision motion capture as ground truth. In both sequences, IMM-PO outperformed all baselines, including learning-based KalmanNet, delivering the smallest position and attitude drift. KalmanNet exhibited local consistency but large global error accumulation over extended sequences.

Figure 7: Snapshots from real-world experiments on flat and complex terrain.

Figure 8: Estimated trajectories for flat and complex terrain; IMM-PO maintains tight congruence with ground truth.

Figure 9: KalmanNet's XY trajectory divergence over long horizons demonstrates the necessity for robust probabilistic adaptation.

Ablation Study and Computational Considerations

A rigorous ablation analysis in slippery conditions confirms the value of dynamic contact modeling. Fixed-noise estimators (ESKF-R, ESKF-L) exhibit inadequate adaptation to slip, yielding higher ATE/RPE. IMM-PO (two-mode realization) achieves optimal accuracy with only a marginal computational overhead (~1ms/step), significantly outperforming non-interactive IMM and three-mode IMM variants (IMM-T). The adaptive mode probabilities consistently respond to foot velocity disturbances during both sustained slip and transient contact events.

KalmanNet Baseline: Limitations and Training Strategy

KalmanNet, a neural network-aided filter, struggles with high-dimensional, contact-switching environments. Enhanced training (curriculum learning, eigenvalue regularization) improves short-term performance but global trajectory consistency remains deficient due to recursive stability issues—evident by the spectral radius analysis.

Figure 10: Closed-loop stability indicator demonstrating deteriorating filter stability over time for KalmanNet on long sequences.

Theoretical and Practical Implications

The explicit incorporation of foot velocity and contact dynamics within a probabilistic filtering architecture advances PO robustness against slippage, contact uncertainty, and terrain-induced disturbances. This methodology yields improved estimator consistency without significant computational penalty, enabling real-time applicability. The approach further underscores the superiority of model-informed probabilistic adaptation versus heuristic or learned thresholding, especially under nonstationary contact dynamics.

Practically, the framework enables reliable PO for legged robots in environments devoid of exteroceptive signals—a crucial requirement for navigation in occluded, reflective, or degraded settings. Theoretically, extending IMM filtering to non-ideal contact models paves the way for advanced inference in switching hybrid dynamical systems, broadening the horizon for adaptive state estimation methodologies in robotics.

Conclusion

The proposed IMM-based proprioceptive odometry provides accurate, robust state estimation for legged robots under dynamic foot-ground interaction, explicitly modeling rolling and slip-induced uncertainties. Empirical results in simulation and real robotics deployments validate the superiority of adaptive multi-model inference over conventional or purely learning-based baselines. The method balances estimation accuracy, robustness, and computational efficiency, establishing a new paradigm for proprioceptive filtering in complex terrain and contact scenarios. Further development should explore finer mode discretizations, integration with exteroceptive cues, and application to whole-body humanoid state estimation architectures.

Markdown Report Issue