Model order reduction of deep structured state-space models: A system-theoretic approach (2403.14833v1)
Abstract: With a specific emphasis on control design objectives, achieving accurate system modeling with limited complexity is crucial in parametric system identification. The recently introduced deep structured state-space models (SSM), which feature linear dynamical blocks as key constituent components, offer high predictive performance. However, the learned representations often suffer from excessively large model orders, which render them unsuitable for control design purposes. The current paper addresses this challenge by means of system-theoretic model order reduction techniques that target the linear dynamical blocks of SSMs. We introduce two regularization terms which can be incorporated into the training loss for improved model order reduction. In particular, we consider modal $\ell_1$ and Hankel nuclear norm regularization to promote sparsity, allowing one to retain only the relevant states without sacrificing accuracy. The presented regularizers lead to advantages in terms of parsimonious representations and faster inference resulting from the reduced order models. The effectiveness of the proposed methodology is demonstrated using real-world ground vibration data from an aircraft.
- Model reduction via balanced realizations: an extension and frequency weighting techniques. IEEE Transactions on Automatic Control, 33(7):687–692, 1988.
- A. C. Antoulas. Approximation of Large-Scale Dynamical Systems. Society for Industrial and Applied Mathematics, 2005.
- A. Bemporad. Linear and nonlinear system identification under ℓ1subscriptℓ1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT- and group-Lasso regularization via L-BFGS-B. arXiv preprint arXiv:2403.03827, 2024.
- G. E. Blelloch. Prefix sums and their applications. 1990.
- Language modeling with gated convolutional networks. In International Conference on Machine Learning, pages 933–941. PMLR, 2017.
- A rank minimization heuristic with application to minimum order system approximation. In Proc. of the American Control Conf., volume 6, pages 4734–4739, 2001.
- Learning neural state-space models: do we need a state estimator? arXiv preprint arXiv:2206.12928, 2022.
- M. Forgione and D. Piga. dynoNet: A neural network architecture for learning dynamical systems. International Journal of Adaptive Control and Signal Processing, 35(4):612–626, 2021.
- K. Glover. All optimal hankel-norm approximations of linear multivariable systems and their L∞superscript𝐿L^{\infty}italic_L start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT-error bounds. International Journal of Control, 39(6):1115–1193, 1984.
- M. Green and D. Limebeer. Linear Robust Control. Dover publications, 2012.
- On the parameterization and initialization of diagonal state space models. Advances in Neural Information Processing Systems, 35:35971–35983, 2022.
- Efficiently modeling long sequences with structured state spaces. The International Conference on Learning Representations (ICLR), 2022.
- T. Katayama. Subspace Methods for System Identification. Springer London, 2005.
- Y. Liu and B. O. D. Anderson. Singular perturbation approximation of balanced systems. International Journal of Control, 50(4):1379–1405, 1989.
- B. Moore. Principal component analysis in linear systems: Controllability, observability, and model reduction. IEEE Transactions on Automatic Control, 26(1):17–32, 1981.
- J. P. Noël and M. Schoukens. F-16 aircraft benchmark based on ground vibration test data. In 2017 Workshop on Nonlinear System Identification Benchmarks, pages 19–23, 2017.
- Resurrecting recurrent neural networks for long sequences. arXiv preprint arXiv:2303.06349, 2023.
- Regularized linear system identification using atomic, nuclear and kernel-based norms: The role of the stability constraint. Automatica, 69:137–149, 2016.
- Language models are unsupervised multitask learners. OpenAI blog, 1(8):9, 2019.
- Recurrent equilibrium networks: Flexible dynamic models with guaranteed stability and robustness. IEEE Transactions on Automatic Control, 2023.
- G. Scarciotti and A. Astolfi. Interconnection-based model order reduction - a survey. European Journal of Control, 75:100929, 2024.
- J. Schoukens and L. Ljung. Nonlinear system identification: A user-oriented road map. IEEE Control Systems Magazine, 39(6):28–99, 2019.
- M. Schoukens and K. Tiels. Identification of block-oriented nonlinear systems starting from linear approximations: A survey. Automatica, 85:272–292, 2017.
- Simplified state space layers for sequence modeling. arXiv preprint arXiv:2208.04933, 2022.
- Long range arena: A benchmark for efficient transformers. arXiv preprint arXiv:2011.04006, 2020.
- R. Tibshirani. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B, 58(1):267–288, 1996.
- Y. Wu and K. He. Group normalization. In Proceedings of the European conference on computer vision (ECCV), pages 3–19, 2018.