Nonlinear Kalman Filtering based on Self-Attention Mechanism and Lattice Trajectory Piecewise Linear Approximation (2404.03915v1)
Abstract: The traditional Kalman filter (KF) is widely applied in control systems, but it relies heavily on the accuracy of the system model and noise parameters, leading to potential performance degradation when facing inaccuracies. To address this issue, introducing neural networks into the KF framework offers a data-driven solution to compensate for these inaccuracies, improving the filter's performance while maintaining interpretability. Nevertheless, existing studies mostly employ recurrent neural network (RNN), which fails to fully capture the dependencies among state sequences and lead to an unstable training process. In this paper, we propose a novel Kalman filtering algorithm named the attention Kalman filter (AtKF), which incorporates a self-attention network to capture the dependencies among state sequences. To address the instability in the recursive training process, a parallel pre-training strategy is devised. Specifically, this strategy involves piecewise linearizing the system via lattice trajectory piecewise linear (LTPWL) expression, and generating pre-training data through a batch estimation algorithm, which exploits the self-attention mechanism's parallel processing ability. Experimental results on a two-dimensional nonlinear system demonstrate that AtKF outperforms other filters under noise disturbances and model mismatches.
- R. E. Kalman, “A New Approach to Linear Filtering and Prediction Problems,” Journal of Basic Engineering, vol. 82, no. 1, pp. 35–45, 1960.
- Y. Bai, B. Yan, C. Zhou, T. Su, and X. Jin, “State of art on state estimation: Kalman filter driven by machine learning,” Annual Reviews in Control, vol. 56, p. 100909, 2023.
- X. Gao, H. Luo, B. Ning, F. Zhao, L. Bao, Y. Gong, Y. Xiao, and J. Jiang, “Rl-akf: An adaptive kalman filter navigation algorithm based on reinforcement learning for ground vehicles,” Remote Sensing, vol. 12, no. 11, p. 1704, 2020.
- J. Tian, R. Xiong, W. Shen, and J. Lu, “State-of-charge estimation of lifepo4 batteries in electric vehicles: A deep-learning enabled approach,” Applied Energy, vol. 291, p. 116812, 2021.
- S. Jung, I. Schlangen, and A. Charlish, “A mnemonic kalman filter for non-linear systems with extensive temporal dependencies,” IEEE Signal Processing Letters, vol. 27, pp. 1005–1009, 2020.
- S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997.
- G. Revach, N. Shlezinger, X. Ni, A. L. Escoriza, R. J. Van Sloun, and Y. C. Eldar, “Kalmannet: Neural network aided kalman filtering for partially known dynamics,” IEEE Transactions on Signal Processing, vol. 70, pp. 1532–1547, 2022.
- J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, “Empirical evaluation of gated recurrent neural networks on sequence modeling,” in NIPS 2014 Workshop on Deep Learning, 2014.
- A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” Advances in Neural Information Processing Systems, vol. 30, 2017.
- J. Tarela and M. Martínez, “Region configurations for realizability of lattice piecewise-linear models,” Mathematical and Computer Modelling, vol. 30, no. 11, pp. 17–27, 1999.
- J.-N. Lin and R. Unbehauen, “Explicit piecewise-linear models,” IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications, vol. 41, no. 12, pp. 931–933, 1994.
- J. Tarela, E. Alonso, and M. Martínez, “A representation method for pwl functions oriented to parallel processing,” Mathematical and Computer Modelling, vol. 13, no. 10, pp. 75–83, 1990.
- S. Ovchinnikov, “Max-min representation of piecewise linear functions,” Beiträge zur Algebra und Geometrie, vol. 43, no. 1, pp. 297–302, 2002.
- J. Xu, T. J. van den Boom, B. De Schutter, and S. Wang, “Irredundant lattice representations of continuous piecewise affine functions,” Automatica, vol. 70, pp. 109–120, 2016.
- J. Wang, J. Xu, and S. Wang, “Lattice trajectory piecewise linear method for the simulation of diode circuits,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 68, no. 5, pp. 2069–2081, 2021.