CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System (2405.11458v1)
Abstract: We explore the usage of LLMs (LLM) in human-in-the-loop human-in-the-plant cyber-physical systems (CPS) to translate a high-level prompt into a personalized plan of actions, and subsequently convert that plan into a grounded inference of sequential decision-making automated by a real-world CPS controller to achieve a control goal. We show that it is relatively straightforward to contextualize an LLM so it can generate domain-specific plans. However, these plans may be infeasible for the physical system to execute or the plan may be unsafe for human users. To address this, we propose CPS-LLM, an LLM retrained using an instruction tuning framework, which ensures that generated plans not only align with the physical system dynamics of the CPS but are also safe for human users. The CPS-LLM consists of two innovative components: a) a liquid time constant neural network-based physical dynamics coefficient estimator that can derive coefficients of dynamical models with some unmeasured state variables; b) the model coefficients are then used to train an LLM with prompts embodied with traces from the dynamical system and the corresponding model coefficients. We show that when the CPS-LLM is integrated with a contextualized chatbot such as BARD it can generate feasible and safe plans to manage external events such as meals for automated insulin delivery systems used by Type 1 Diabetes subjects.
- Statistical Conformance Checking of Aviation Cyber-Physical Systems by Mining Physics Guided Models. In 2023 IEEE Aerospace Conference, 1–8. IEEE.
- Bergman, R. N. 2021. Origins and history of the minimal model of glucose regulation. Frontiers in endocrinology, 11: 583016.
- Language Models are Few-Shot Learners. In Larochelle, H.; Ranzato, M.; Hadsell, R.; Balcan, M.; and Lin, H., eds., Advances in Neural Information Processing Systems, volume 33, 1877–1901. Curran Associates, Inc.
- Butcher, J. C. 1996. A history of Runge-Kutta methods. Applied numerical mathematics, 20(3): 247–260.
- Alpagasus: Training a better alpaca with fewer data. arXiv preprint arXiv:2307.08701.
- PaLM: Scaling Language Modeling with Pathways.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In North American Chapter of the Association for Computational Linguistics.
- Robust satisfaction of temporal logic over real-valued signals. In International Conference on Formal Modeling and Analysis of Timed Systems, 92–106. Springer.
- Successful at-home use of the tandem control-IQ artificial pancreas system in young children during a randomized controlled trial. Diabetes technology & therapeutics, 21(4): 159–169.
- A comprehensive survey on safe reinforcement learning. Journal of Machine Learning Research, 16(1): 1437–1480.
- GoogleAI. 2023. Bard: A Large Language Model from Google AI. https://ai.googleblog.com/2022/01/lamda-language-model-for-dialogue.html. Accessed December 23, 2023.
- Liquid time-constant networks. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, 7657–7666.
- A real-world study of user characteristics, safety and efficacy of open-source closed-loop systems and Medtronic 670G. Diabetes, Obesity and Metabolism, 23(8): 1989–1994.
- Optimal control of mixed logical dynamical systems with linear temporal logic specifications. In 2008 47th IEEE Conference on Decision and Control, 2117–2122. IEEE.
- Temporal-logic-based reactive mission and motion planning. IEEE transactions on robotics, 25(6): 1370–1381.
- Synthesis for human-in-the-loop control systems. In Tools and Algorithms for the Construction and Analysis of Systems: 20th International Conference, TACAS 2014, Held as Part of the European Joint Conferences on Theory and Practice of Software, ETAPS 2014, Grenoble, France, April 5-13, 2014. Proceedings 20, 470–484. Springer.
- Detection of Unknown-Unknowns in Cyber-Physical Systems using Statistical Conformance with Physics Guided Process Models. arXiv preprint arXiv:2309.02603.
- Cyphytest: Cyber physical interaction aware test case generation to identify operational changes. In 2022 IEEE 5th International Conference on Industrial Cyber-Physical Systems (ICPS), 01–06. IEEE.
- The UVA/PADOVA type 1 diabetes simulator: new features. Journal of diabetes science and technology, 8(1): 26–34.
- The lyapunov neural network: Adaptive stability certification for safe learning of dynamical systems. In Conference on Robot Learning, 466–476. PMLR.
- Benefits of a bolus calculator in pre-and postprandial glycaemic control and meal flexibility of paediatric patients using continuous subcutaneous insulin infusion (CSII). Diabetic Medicine, 25(9): 1036–1042.
- Alpaca: A Strong, Replicable Instruction-Following Model. Human Centered Artificial Intelligence. Website: Stanford University, Web page: https://crfm.stanford.edu/2023/03/13/alpaca.html.
- LaMDA: Language Models for Dialog Applications. arXiv:2201.08239.
- Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288.
- A Survey of Large Language Models. arXiv:2303.18223.
- Ayan Banerjee (159 papers)
- Aranyak Maity (6 papers)
- Payal Kamboj (9 papers)
- Sandeep K. S. Gupta (9 papers)