Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Transfer of Safety Controllers Through Learning Deep Inverse Dynamics Model (2405.13735v2)

Published 22 May 2024 in eess.SY, cs.AI, cs.LG, and cs.SY

Abstract: Control barrier certificates have proven effective in formally guaranteeing the safety of the control systems. However, designing a control barrier certificate is a time-consuming and computationally expensive endeavor that requires expert input in the form of domain knowledge and mathematical maturity. Additionally, when a system undergoes slight changes, the new controller and its correctness certificate need to be recomputed, incurring similar computational challenges as those faced during the design of the original controller. Prior approaches have utilized transfer learning to transfer safety guarantees in the form of a barrier certificate while maintaining the control invariant. Unfortunately, in practical settings, the source and the target environments often deviate substantially in their control inputs, rendering the aforementioned approach impractical. To address this challenge, we propose integrating \emph{inverse dynamics} -- a neural network that suggests required action given a desired successor state -- of the target system with the barrier certificate of the source system to provide formal proof of safety. In addition, we propose a validity condition that, when met, guarantees correctness of the controller. We demonstrate the effectiveness of our approach through three case studies.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (32)
  1. Hybrid systems III: Verification and control, volume 3. Springer Science & Business Media.
  2. Control barrier functions: Theory and applications. In 2019 18th European control conference (ECC), 3420–3431. IEEE.
  3. Formally verified neural network control barrier certificates for unknown systems. In Proceedings of the 22nd World Congress of the International Federation of Automatic Control, 2742–2747.
  4. Bozinovski, S. (2020). Reminder of the first paper on transfer learning in neural networks, 1976. Informatica, 44(3).
  5. Lazily adapted constant kinky inference for nonparametric regression and model-reference adaptive control. Automatica, 122, 109216.
  6. Transfer from simulation to real world through learning deep inverse dynamics model. arXiv preprint arXiv:1610.03518.
  7. Clark, A. (2021). Control barrier functions for stochastic systems. Automatica, 130, 109688.
  8. Safe control with learned certificates: A survey of neural Lyapunov, barrier, and contraction methods for robotics and control. IEEE Transactions on Robotics, 39, 1749–1767.
  9. Satisfiability modulo theories: introduction and applications. Communications of the ACM, 54(9), 69–77.
  10. Deep Learning. MIT Press.
  11. Multilayer feedforward networks are universal approximators. Neural Networks, 2(5), 359–366.
  12. Probabilistic safety verification of stochastic hybrid systems using barrier certificates. ACM Transactions on Embedded Computing Systems (TECS), 16(5s), 1–19.
  13. Formal synthesis of stochastic systems via control barrier certificates. IEEE Transactions on Automatic Control, 66(7), 3097–3110.
  14. Neural certificates for safe control policies. arXiv preprint arXiv:2006.08465.
  15. Flight control design using non-linear inverse dynamics. Automatica, 24(4), 471–483.
  16. Safety certification for stochastic systems via neural barrier functions. IEEE Control Systems Letters, 7, 973–978.
  17. Transfer Learning for Barrier Certificates. In 62nd IEEE Conference on Decision and Control.
  18. Formal verification of unknown discrete- and continuous-time systems: A data-driven approach. IEEE Transactions on Automatic Control, 68(5), 3011–3024.
  19. Data-driven controller synthesis of unknown nonlinear polynomial systems via control barrier certificates. In Learning for Dynamics and Control Conference, 763–776. PMLR.
  20. Parrilo, P.A. (2003). Semidefinite programming relaxations for semialgebraic problems. Mathematical programming, 96, 293–320.
  21. Automated and formal synthesis of neural barrier certificates for dynamical models. In International conference on tools and algorithms for the construction and analysis of systems, 370–388. Springer.
  22. Safety verification of hybrid systems using barrier certificates. In Hybrid Systems: Computation and Control, 477–492. Springer Berlin Heidelberg, Berlin, Heidelberg.
  23. A framework for worst-case and stochastic safety verification using barrier certificates. IEEE Transactions on Automatic Control, 52(8), 1415–1428.
  24. Acceleration of global search by implementing dual estimates for Lipschitz constant. In International Conference on Numerical Computations: Theory and Algorithms, 478–486. Springer.
  25. Transfer learning. In Handbook of research on machine learning applications and trends: algorithms, methods, and techniques, 242–264. IGI global.
  26. A survey of transfer learning. Journal of Big data, 3(1), 1–40.
  27. Constructive safety using control barrier functions. IFAC Proceedings Volumes, 40(12), 462–467.
  28. Estimation of the Lipschitz constant of a function. Journal of Global Optimization, 8, 91–103.
  29. Zhang, Z. (2018). Improved Adam optimizer for deep neural networks. In 2018 IEEE/ACM 26th international symposium on quality of service (IWQoS), 1–2.
  30. Synthesizing barrier certificates using neural networks. In Proceedings of the 23rd international conference on hybrid systems: computation and control, 1–11.
  31. Towards trustworthy AI: Sandboxing AI-based unverified controllers for safe and secure cyber-physical systems. In 2023 62nd IEEE Conference on Decision and Control (CDC), 1833–1840.
  32. Neural Lyapunov control of unknown nonlinear systems with stability guarantees. Advances in Neural Information Processing Systems, 35, 29113–29125.
Citations (2)

Summary

We haven't generated a summary for this paper yet.