Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Promises of Deep Kernel Learning for Control Synthesis (2309.06569v2)

Published 12 Sep 2023 in eess.SY, cs.LG, and cs.SY

Abstract: Deep Kernel Learning (DKL) combines the representational power of neural networks with the uncertainty quantification of Gaussian Processes. Hence, it is potentially a promising tool to learn and control complex dynamical systems. In this work, we develop a scalable abstraction-based framework that enables the use of DKL for control synthesis of stochastic dynamical systems against complex specifications. Specifically, we consider temporal logic specifications and create an end-to-end framework that uses DKL to learn an unknown system from data and formally abstracts the DKL model into an Interval Markov Decision Process (IMDP) to perform control synthesis with correctness guarantees. Furthermore, we identify a deep architecture that enables accurate learning and efficient abstraction computation. The effectiveness of our approach is illustrated on various benchmarks, including a 5-D nonlinear stochastic system, showing how control synthesis with DKL can substantially outperform state-of-the-art competitive methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (26)
  1. S. Dutta, S. Jha, S. Sankaranarayanan, and A. Tiwari, “Learning and verification of feedback control systems using feedforward neural networks,” IFAC-PapersOnLine, vol. 51, no. 16, pp. 151–156, 2018, 6th IFAC Conference on Analysis and Design of Hybrid Systems ADHS 2018.
  2. S. Haesaert, P. M. Van den Hof, and A. Abate, “Data-driven and model-based verification via bayesian identification and reachability analysis,” Automatica, vol. 79, pp. 115–126, 2017.
  3. J. Jackson, L. Laurenti, E. Frew, and M. Lahijanian, “Strategy synthesis for partially-known switched stochastic systems,” in Proceedings of the 24th International Conference on Hybrid Systems: Computation and Control, ser. HSCC ’21.   New York, NY, USA: Association for Computing Machinery, 2021.
  4. A. Nejati and M. Zamani, “Data-driven synthesis of safety controllers via multiple control barrier certificates,” IEEE Control Systems Letters, vol. 7, pp. 2497–2502, 2023.
  5. S. Haesaert, A. Abate, and P. Van den Hof, “Data-driven and model-based verification: A bayesian identification approach,” in 2015 54th IEEE Conference on Decision and Control (CDC), 2015, pp. 6830–6835.
  6. C. Knuth, G. Chou, N. Ozay, and D. Berenson, “Planning with learned dynamics: Probabilistic guarantees on safety and reachability via lipschitz constants,” IEEE Robotics and Automation Letters, vol. 6, no. 3, pp. 5129–5136, 2021.
  7. A. Lederer, J. Umlauft, and S. Hirche, “Uniform error bounds for gaussian process regression with application to safe control,” Advances in Neural Information Processing Systems, vol. 32, 2019.
  8. J. Jackson, L. Laurenti, E. Frew, and M. Lahijanian, “Formal verification of unknown dynamical systems via gaussian process regression,” arXiv preprint arXiv:2201.00655, 2021.
  9. R. Wajid, A. U. Awan, and M. Zamani, “Formal synthesis of safety controllers for unknown stochastic control systems using gaussian process learning,” in Learning for Dynamics and Control Conference.   PMLR, 2022, pp. 624–636.
  10. M. Lahijanian, S. B. Andersson, and C. Belta, “Formal verification and synthesis for discrete-time stochastic systems,” IEEE Transactions on Automatic Control, vol. 60, no. 8, pp. 2031–2045, Aug. 2015.
  11. G. De Giacomo and M. Y. Vardi, “Linear temporal logic and linear dynamic logic on finite traces,” in IJCAI’13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence.   Association for Computing Machinery, 2013, pp. 854–860.
  12. R. Givan, S. Leach, and T. Dean, “Bounded-parameter markov decision processes,” Artificial Intelligence, vol. 122, no. 1, pp. 71–109, 2000.
  13. N. Cauchi, et al., “Efficiency through uncertainty: Scalable formal synthesis for stochastic hybrid systems,” in Proc. of the 22nd ACM International Conference on Hybrid Systems: Computation and Control, 2019, pp. 240–251.
  14. T. Wei and C. Liu, “Safe control with neural network dynamic models,” in Proceedings of The 4th Annual Learning for Dynamics and Control Conference, ser. Proceedings of Machine Learning Research, R. Firoozi, et al., Eds., vol. 168.   PMLR, 23–24 Jun 2022, pp. 739–750.
  15. S. A. Adams, M. Lahijanian, and L. Laurenti, “Formal control synthesis for stochastic neural network dynamic models,” IEEE Control Systems Letters, 2022.
  16. R. Mazouz, et al., “Safety guarantees for neural network dynamic systems via stochastic barrier functions,” vol. 35, 2022, pp. 9672–9686.
  17. A. G. Wilson, Z. Hu, R. Salakhutdinov, and E. P. Xing, “Deep kernel learning,” in Artificial Intelligence and Statistics.   PMLR, 2016, pp. 370–378.
  18. S. W. Ober, C. E. Rasmussen, and M. van der Wilk, “The promises and pitfalls of deep kernel learning,” in Uncertainty in Artificial Intelligence.   PMLR, 2021, pp. 1206–1216.
  19. S. Wang, et al., “Beta-CROWN: Efficient bound propagation with per-neuron split constraints for complete and incomplete neural network verification,” Advances in Neural Information Processing Systems, vol. 34, 2021.
  20. Y. Sui, A. Gotovos, J. Burdick, and A. Krause, “Safe exploration for optimization with gaussian processes,” in Proceedings of the 32nd International Conference on Machine Learning, ser. Proceedings of Machine Learning Research, F. Bach and D. Blei, Eds., vol. 37.   Lille, France: PMLR, 07–09 Jul 2015, pp. 997–1005.
  21. R. Calandra, J. Peters, C. E. Rasmussen, and M. P. Deisenroth, “Manifold gaussian processes for regression,” in 2016 International joint conference on neural networks (IJCNN).   IEEE, 2016, pp. 3338–3345.
  22. M. Deisenroth and C. E. Rasmussen, “Pilco: A model-based and data-efficient approach to policy search,” in Proceedings of the 28th International Conference on machine learning (ICML-11), 2011, pp. 465–472.
  23. L. Laurenti, et al., “Formal and efficient synthesis for continuous-time linear stochastic hybrid processes,” IEEE Transactions on Automatic Control, 2020.
  24. A. Patané, et al., “Adversarial robustness guarantees for gaussian processes,” Journal of Machine Learning Research, vol. 23, 2022.
  25. M. Dutreix and S. Coogan, “Efficient verification for stochastic mixed monotone systems,” in 2018 ACM/IEEE 9th International Conference on Cyber-Physical Systems (ICCPS), 2018, pp. 150–161.
  26. G. Delimpaltadakis, M. Lahijanian, M. Mazo Jr, and L. Laurenti, “Interval markov decision processes with continuous action-spaces,” in Proceedings of the 26th ACM International Conference on Hybrid Systems: Computation and Control, 2023, pp. 1–10.
Citations (5)

Summary

We haven't generated a summary for this paper yet.