AI-Aided Kalman Filters (2410.12289v3)
Abstract: The Kalman filter (KF) and its variants are among the most celebrated algorithms in signal processing. These methods are used for state estimation of dynamic systems by relying on mathematical representations in the form of simple state-space (SS) models, which may be crude and inaccurate descriptions of the underlying dynamics. Emerging data-centric AI techniques tackle these tasks using deep neural networks (DNNs), which are model-agnostic. Recent developments illustrate the possibility of fusing DNNs with classic Kalman-type filtering, obtaining systems that learn to track in partially known dynamics. This article provides a tutorial-style overview of design approaches for incorporating AI in aiding KF-type algorithms. We review both generic and dedicated DNN architectures suitable for state estimation, and provide a systematic presentation of techniques for fusing AI tools with KFs and for leveraging partial SS modeling and data, categorizing design approaches into task-oriented and SS model-oriented. The usefulness of each approach in preserving the individual strengths of model-based KFs and data-driven DNNs is investigated in a qualitative and quantitative study, whose code is publicly available, illustrating the gains of hybrid model-based/data-driven designs. We also discuss existing challenges and future research directions that arise from fusing AI and Kalman-type algorithms.
- R. E. Kalman, “A new approach to linear filtering and prediction problems,” Journal of Basic Engineering, vol. 82, no. 1, pp. 35–45, 1960.
- S. F. Schmidt, “The Kalman filter-its recognition and development for aerospace applications,” Journal of Guidance and Control, vol. 4, no. 1, pp. 4–7, 1981.
- M. S. Grewal and A. P. Andrews, “Applications of Kalman filtering in aerospace 1960 to the present [historical perspectives],” IEEE Control Syst. Mag., vol. 30, no. 3, pp. 69–78, 2010.
- S. Gannot and A. Yeredor, “The Kalman filter,” Springer Handbook of Speech Processing, pp. 135–160, 2008.
- P. Becker, H. Pandya, G. Gebhardt, C. Zhao, C. J. Taylor, and G. Neumann, “Recurrent Kalman networks: Factorized inference in high-dimensional deep feature spaces,” in International Conference on Machine Learning, 2019, pp. 544–552.
- R. Krishnan, U. Shalit, and D. Sontag, “Structured inference networks for nonlinear state space models,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31, no. 1, 2017.
- S. Kuutti, R. Bowden, Y. Jin, P. Barber, and S. Fallah, “A survey of deep learning applications to autonomous vehicle control,” IEEE Trans. Intell. Transp. Syst., vol. 22, no. 2, pp. 712–733, 2020.
- J. Chen and X. Ran, “Deep learning with edge computing: A review,” Proc. IEEE, vol. 107, no. 8, pp. 1655–1674, 2019.
- N. Shlezinger, J. Whang, Y. C. Eldar, and A. G. Dimakis, “Model-based deep learning,” Proc. IEEE, vol. 111, no. 5, pp. 465–499, 2023.
- A. Klushyn, R. Kurle, M. Soelch, B. Cseke, and P. van der Smagt, “Latent matters: Learning deep state-space models,” Advances in Neural Information Processing Systems, vol. 34, pp. 10 234–10 245, 2021.
- H. Coskun, F. Achilles, R. DiPietro, N. Navab, and F. Tombari, “Long short-term memory Kalman filters: Recurrent neural estimators for pose regularization,” in Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 5524–5532.
- D. Gedon, N. Wahlström, T. B. Schön, and L. Ljung, “Deep state space models for nonlinear system identification,” IFAC-PapersOnLine, vol. 54, no. 7, pp. 481–486, 2021.
- T. Imbiriba, O. Straka, J. Duník, and P. Closas, “Augmented physics-based machine learning for navigation and tracking,” IEEE Trans. Aerosp. Electron. Syst., vol. 60, no. 3, pp. 2692–2704, 2024.
- G. Revach, N. Shlezinger, X. Ni, A. L. Escoriza, R. J. van Sloun, and Y. C. Eldar, “KalmanNet: Neural network aided Kalman filtering for partially known dynamics,” IEEE Trans. Signal Process., vol. 70, pp. 1532–1547, 2022.
- I. Buchnik, G. Revach, D. Steger, R. J. van Sloun, T. Routtenberg, and N. Shlezinger, “Latent-KalmanNet: Learned Kalman filtering for tracking from high-dimensional signals,” IEEE Trans. Signal Process., vol. 72, pp. 352–367, 2023.
- K. Pratik, R. A. Amjad, A. Behboodi, J. B. Soriaga, and M. Welling, “Neural augmentation of Kalman filter with hypernetwork for channel tracking,” in IEEE Global Communications Conference (GLOBECOM), 2021.
- V. G. Satorras, Z. Akata, and M. Welling, “Combining generative and discriminative models for hybrid inference,” in Advances in Neural Information Processing Systems, 2019, pp. 13 802–13 812.
- A. Ghosh, A. Honoré, and S. Chatterjee, “DANSE: Data-driven non-linear state estimation of model-free process in unsupervised learning setup,” IEEE Trans. Signal Process., vol. 72, pp. 1824–1838, 2024.
- G. Revach, N. Shlezinger, T. Locher, X. Ni, R. J. G. van Sloun, and Y. C. Eldar, “Unsupervised learned Kalman filtering,” in European Signal Processing Conference (EUSIPCO), 2022, pp. 1571–1575.
- S. N. Aspragkathos, G. C. Karras, and K. J. Kyriakopoulos, “Event-triggered image moments predictive control for tracking evolving features using UAVs,” IEEE Robot. Autom. Lett., vol. 9, no. 2, pp. 1019–1026, 2024.
- A. Milstein, G. Revach, H. Deng, H. Morgenstern, and N. Shlezinger, “Neural augmented Kalman filtering with Bollinger bands for pairs trading,” IEEE Trans. Signal Process., vol. 72, pp. 1974–1988, 2024.
- A. Gu, I. Johnson, K. Goel, K. Saab, T. Dao, A. Rudra, and C. Ré, “Combining recurrent, convolutional, and continuous-time models with linear state space layers,” Advances in neural information processing systems, vol. 34, pp. 572–585, 2021.
- N. Shlezinger and T. Routtenberg, “Discriminative and generative learning for linear estimation of random signals [lecture notes],” IEEE Signal Process. Mag., vol. 40, no. 6, pp. 75–82, 2023.
- L. Zhou, Z. Luo, T. Shen, J. Zhang, M. Zhen, Y. Yao, T. Fang, and L. Quan, “KFNet: Learning temporal camera relocalization using Kalman filtering,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 4919–4928.
- S. Jouaber, S. Bonnabel, S. Velasco-Forero, and M. Pilte, “NNAKF: A neural network adapted Kalman filter for target tracking,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, pp. 4075–4079.
- H. E. Rauch, F. Tung, and C. T. Striebel, “Maximum likelihood estimates of linear dynamic systems,” AIAA Journal, vol. 3, no. 8, pp. 1445–1450, 1965.
- A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” in Advances in Neural Information Processing Systems, 2017, pp. 5998–6008.
- A. Gu and T. Dao, “Mamba: Linear-time sequence modeling with selective state spaces,” arXiv preprint arXiv:2312.00752, 2023.
- A. Gu, K. Goel, and C. Ré, “Efficiently modeling long sequences with structured state spaces,” International Conference on Learning Representations (ICLR), 2022.
- N. Shlezinger, Y. C. Eldar, and S. P. Boyd, “Model-based deep learning: On the intersection of deep learning and optimization,” IEEE Access, vol. 10, pp. 115 384–115 398, 2022.
- A. H. Li, P. Wu, and M. Kennedy, “Replay overshooting: Learning stochastic latent dynamics with the extended Kalman filter,” in IEEE International Conference on Robotics and Automation (ICRA), 2021.
- L. Xu and R. Niu, “EKFNet: Learning system noise statistics from measurement data,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, pp. 4560–4564.
- G. Revach, X. Ni, N. Shlezinger, R. J. van Sloun, and Y. C. Eldar, “RTSNet: Learning to Smooth in Partially Known State-Space Models,” IEEE Trans. Signal Process., vol. 71, pp. 4441–4456, 2023.
- G. Choi, J. Park, N. Shlezinger, Y. C. Eldar, and N. Lee, “Split-KalmanNet: A robust model-based deep learning approach for state estimation,” IEEE Trans. Veh. Technol., vol. 72, no. 9, pp. 12 326–12 331, 2023.
- J. Wang, X. Geng, and J. Xu, “Nonlinear Kalman filtering based on self-attention mechanism and lattice trajectory piecewise linear approximation,” arXiv preprint arXiv:2404.03915, 2024.
- A. N. Putri, C. Machbub, D. Mahayana, and E. Hidayat, “Data driven linear quadratic Gaussian control design,” IEEE Access, vol. 11, pp. 24 227–24 237, 2023.
- Y. Dahan, G. Revach, J. Dunik, and N. Shlezinger, “Uncertainty quantification in deep learning based Kalman filters,” arXiv preprint arXiv:2309.03058, 2023.
- X. Ni, G. Revach, and N. Shlezinger, “Adaptive KalmanNet: Data-driven Kalman filter with fast adaptation,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024, pp. 5970–5974.
- A. Sakai and Y. Kuroda, “Discriminative parameter training of unscented Kalman filter,” IFAC Proceedings Volumes, vol. 43, no. 18, pp. 677–682, 2010, 5th IFAC Symposium on Mechatronic Systems.
- Z. Fan, D. Shen, Y. Bao, K. Pham, E. Blasch, and G. Chen, “RNN-UKF: Enhancing hyperparameter auto-tuning in unscented kalman filters through recurrent neural networks,” in 27th International Conference on Information Fusion, Venice, Italy, 2024.
- A. Chiuso and G. Pillonetto, “System identification: A machine learning perspective,” Annual Review of Control, Robotics, and Autonomous Systems, vol. 2, no. 1, pp. 281–304, 2019.
- L. Ljung, C. Andersson, K. Tiels, and T. B. Schön, “Deep learning and system identification,” IFAC-PapersOnLine, vol. 53, no. 2, pp. 1175–1181, 2020, 21st IFAC World Congress.
- J. Suykens, B. Moor, and J. Vandewalle, “Nonlinear system identification using neural state space models, applicable to robust control design,” International Journal of Control, vol. 62, no. 1, p. 129–152, 1995.
- D. Masti and A. Bemporad, “Learning nonlinear state–space models using autoencoders,” Automatica, vol. 129, pp. 0005–1098,, 2021.
- A. Gorji and M. Menhaj, “Identification of nonlinear state space models using an MLP network trained by the EM algorithm,” in 2008 IEEE International Joint Conference on Neural Networks, 2008, p. 53–60.
- Y. Wang, “A new concept using LSTM neural networks for dynamic system identification,” in 2017 American Control Conference (ACC, 2017, pp. 5324–5329,.
- M. Fraccaro, S. Sønderby, U. Paquet, and O. Winther, “Sequential neural models with stochastic layers,” in Advances in neural information processing systems, 2016, p. 29.
- F. Arnold and R. King, “State–space modeling for control based on physics-informed neural networks,” Engineering Applications of Artificial Intelligence, vol. 101, p. 104195, 2021. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0952197621000427
- Y. Bao, J. M. Velni, A. Basina, and M. Shahbakhti, “Identification of state-space linear parameter-varying models using artificial neural networks,” IFAC-PapersOnLine,Volume, vol. 53, pp. 5286–5291, 2405–8963,, 2020.
- M. Schoukens, “Improved initialization of state-space artificial neural networks,” arXiv preprint arXiv:2103.14516, 2021.
- M. Forgione and D. Piga, “Model structures and fitting criteria for system identification with neural networks,” in IEEE International Conference Application of Information and Communication Technologies, 2020.
- S. Tang, T. Imbiriba, J. Duník, O. Straka, and P. Closas, “Augmented Physics-Based Models for High-Order Markov Filtering,” Sensors, vol. 24, no. 18, 2024. [Online]. Available: https://www.mdpi.com/1424-8220/24/18/6132
- T. Imbiriba, A. Demirkaya, J. Duník, O. Straka, D. Erdoğmuş, and P. Closas, “Hybrid neural network augmented physics-based models for nonlinear filtering,” in International Conference on Information Fusion (FUSION). IEEE, 2022.
- R. Krishnan, U. Shalit, and D. Sontag, “Deep Kalman filters,” arXiv preprint arXiv:1511.05121., 2015.
- D. Simon, “Kalman filtering with state constraints: a survey of linear and nonlinear algorithms,” IET Control Theory & Applications, vol. 4, no. 8, pp. 1303–1318, 2010.
- J. Humpherys, P. Redd, and J. West, “A fresh look at the kalman filter,” SIAM review, vol. 54, no. 4, pp. 801–823, 2012.
- A. Bemporad, “Recurrent neural network training with convex loss and regularization functions by extended kalman filtering,” IEEE Trans. Autom. Control, vol. 68, no. 9, pp. 5661–5668, 2023.
- J. Dunik, O. Straka, O. Kost, S. Tang, T. Imbiriba, and P. Closas, “Noise identification for data-augmented physics-based state-space models,” in Accepted for IEEE Workshop on Signal Processing Systems (SiPS 2024), Cambridge, MA, USA, 2024.
- L. Ljung, “Perspectives on system identification,” IFAC Proceedings Volumes, vol. 41, no. 2, pp. 7172–7184, 2008, 17th IFAC World Congress.
- D. Li, J. Zhou, and Y. Liu, “Recurrent-neural-network-based unscented Kalman filter for estimating and compensating the random drift of MEMS gyroscopes in real time,” Mechanical Systems and Signal Processing, vol. 147, p. 107057, 2021.
- G. Torrente, E. Kaufmann, P. Föhn, and D. Scaramuzza, “Data-driven MPC for quadrotors,” IEEE Robot. Autom. Lett., vol. 6, no. 2, pp. 3769–3776, 2021.
- M. Raissia, P. Perdikarisb, and G. Karniadakis, “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations,” Journal of Computational physics, vol. 378, pp. 686–707, 2019.
- A. Ghosh, J. Hong, D. Yin, and K. Ramchandran, “Robust federated learning in a heterogeneous environment,” arXiv preprint arXiv:1906.06629, 2019.
- R. Olfati-Saber, “Distributed kalman filtering for sensor networks,” in 2007 46th IEEE Conference on Decision and Control, 2007, pp. 5492–5498.