Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models (2403.09567v3)
Abstract: The deployment of autonomous agents in environments involving human interaction has increasingly raised security concerns. Consequently, understanding the circumstances behind an event becomes critical, requiring the development of capabilities to justify their behaviors to non-expert users. Such explanations are essential in enhancing trustworthiness and safety, acting as a preventive measure against failures, errors, and misunderstandings. Additionally, they contribute to improving communication, bridging the gap between the agent and the user, thereby improving the effectiveness of their interactions. This work presents an accountability and explainability architecture implemented for ROS-based mobile robots. The proposed solution consists of two main components. Firstly, a black box-like element to provide accountability, featuring anti-tampering properties achieved through blockchain technology. Secondly, a component in charge of generating natural language explanations by harnessing the capabilities of LLMs over the data contained within the previously mentioned black box. The study evaluates the performance of our solution in three different scenarios, each involving autonomous agent navigation functionalities. This evaluation includes a thorough examination of accountability and explainability metrics, demonstrating the effectiveness of our approach in using accountable data from robot actions to obtain coherent, accurate and understandable explanations, even when facing challenges inherent in the use of autonomous agents in real-world scenarios.
- Explainable Goal-driven Agents and Robots - A Comprehensive Review, ACM Computing Surveys 55 (2023) 1–41.
- Towards Providing Explanations for AI Planner Decisions, CoRR abs/1810.0 (2018).
- A. Rosenfeld, A. Richardson, Explainability in human–agent systems, Autonomous Agents and Multi-Agent Systems 33 (2019) 673–705.
- Explainable robotic systems: understanding goal-driven actions in a reinforcement learning scenario, Neural Computing and Applications 35 (2023) 18113–18130.
- Y. Al-Slais, M. Ali, Robotic Process Automation and Intelligent Automation Security Challenges: A Review, in: 2023 International Conference On Cyber Management And Engineering (CyMaEn), IEEE, 2023, pp. 71–77. URL: https://ieeexplore.ieee.org/document/10050996/. doi:10.1109/CyMaEn57228.2023.10050996.
- Secure and transparent audit logs with BlockAudit, Journal of Network and Computer Applications 145 (2019) 102406.
- Blockchain for Embedded System Accountability, in: 2021 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), IEEE, 2021, pp. 1–5. URL: https://ieeexplore.ieee.org/document/9461143/. doi:10.1109/ICBC51069.2021.9461143.
- Analysis of the Performance of Different Accountability Strategies for Autonomous Robots, in: J. J. Gude Prego, J. G. de la Puerta, P. García Bringas, H. Quintián, E. Corchado (Eds.), 14th International Conference on Computational Intelligence in Security for Information Systems and 12th International Conference on European Transnational Educational (CISIS 2021 and ICEUTE 2021), Springer International Publishing, Cham, 2022, pp. 41–51. URL: https://link.springer.com/10.1007/978-3-030-87872-6_5. doi:10.1007/978-3-030-87872-6_5.
- A secure and auditable logging infrastructure based on a permissioned blockchain, Computers & Security 87 (2019) 101602.
- A blockchain integration to support failures prediction from log files in multi-agent systems technology, Expert Systems with Applications 240 (2024) 122122.
- M. Iqbal, R. Matulevičius, Comparison of Blockchain-Based Solutions to Mitigate Data Tampering Security Risk, in: C. Di Ciccio, R. Gabryelczyk, L. García-Bañuelos, T. Hernaus, R. Hull, M. Indihar Štemberger, A. K Ho, M. Staples (Eds.), Business Process Management: Blockchain and Central and Eastern Europe Forum, Springer International Publishing, Cham, 2019, pp. 13–28. URL: http://link.springer.com/10.1007/978-3-030-30429-4_2. doi:10.1007/978-3-030-30429-4_2.
- Y. Shehu, R. Harper, Enhancements to Language Modeling Techniques for Adaptable Log Message Classification, IEEE Transactions on Network and Service Management 19 (2022) 4662–4675.
- Robust and Transferable Anomaly Detection in Log Data using Pre-Trained Language Models, in: 2021 IEEE/ACM International Workshop on Cloud Intelligence (CloudIntelligence), IEEE, 2021, pp. 19–24. URL: https://ieeexplore.ieee.org/document/9527018/. doi:10.1109/CloudIntelligence52565.2021.00013.
- Explainable agents and robots: Results from a systematic literature review, Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2 (2019) 1078–1088.
- Explainable Agency for Intelligent Autonomous Systems, Proceedings of the AAAI Conference on Artificial Intelligence 31 (2017) 4762–4763.
- T. Kocmi, C. Federmann, Large Language Models Are State-of-the-Art Evaluators of Translation Quality, arXiv preprint arXiv:2302.14520 (2023).
- Benchmarking Large Language Models for News Summarization, Transactions of the Association for Computational Linguistics 12 (2024) 39–57.
- Automatic Generation of Programming Exercises and Code Explanations Using Large Language Models, in: Proceedings of the 2022 ACM Conference on International Computing Education Research - Volume 1, ICER ’22, ACM, New York, NY, USA, 2022, pp. 27–43. URL: https://doi.org/10.1145/3501385.3543957https://dl.acm.org/doi/10.1145/3501385.3543957. doi:10.1145/3501385.3543957.
- Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation, arXiv preprint arXiv:2307.11019 (2023).
- A comprehensive survey on blockchain technology, Sustainable Energy Technologies and Assessments 52 (2022) 102039.
- H. Guo, X. Yu, A survey on blockchain technology and its security, Blockchain: Research and Applications 3 (2022) 100067.
- Permissioned vs. Permissionless Blockchain: How and Why There Is Only One Right Choice, Journal of Software 16 (2021) 95–106.
- Proof-of-Stake Consensus Mechanisms for Future Blockchain Networks: Fundamentals, Applications and Opportunities, IEEE Access 7 (2019) 85727–85745.
- A. I. Sanka, R. C. Cheung, A systematic review of blockchain scalability: Issues, solutions, analysis and future research, Journal of Network and Computer Applications 195 (2021) 103232.
- Blockchain-based trusted accountability in the maintenance of medical imaging equipment, Expert Systems with Applications 241 (2024) 122718.
- An Overview of Smart Contract and Use Cases in Blockchain Technology, in: 2018 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT), IEEE, 2018, pp. 1–4. URL: https://ieeexplore.ieee.org/document/8494045/. doi:10.1109/ICCCNT.2018.8494045.
- BlockIPFS - Blockchain-Enabled Interplanetary File System for Forensic and Trusted Data Traceability, in: 2019 IEEE International Conference on Blockchain (Blockchain), IEEE, 2019, pp. 18–25. URL: https://ieeexplore.ieee.org/document/8946164/. doi:10.1109/Blockchain.2019.00012.
- L. Shekhtman, E. Waisbard, EngraveChain: Tamper-proof distributed log system, in: Proceedings of the 2nd Workshop on Blockchain-enabled Networked Sensor, BlockSys’19, ACM, New York, NY, USA, 2019, pp. 8–14. URL: https://doi.org/10.1145/3362744.3363346https://dl.acm.org/doi/10.1145/3362744.3363346. doi:10.1145/3362744.3363346.
- A Blockchain-Facilitated Secure Sensing Data Processing and Logging System, IEEE Access 11 (2023) 21712–21728.
- Immutable Log Storage as a Service on Private and Public Blockchains, IEEE Transactions on Services Computing 16 (2023) 356–369.
- Permission less Block chains and Secure Logging, in: 2019 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), IEEE, 2019, pp. 56–60. URL: https://ieeexplore.ieee.org/document/8751306/. doi:10.1109/BLOC.2019.8751306.
- Robot Accident Investigation: A Case Study in Responsible Robotics, in: A. Cavalcanti, B. Dongol, R. Hierons, J. Timmis, J. Woodcock (Eds.), Software Engineering for Robotics, Springer International Publishing, Cham, 2021, pp. 165–187. URL: https://doi.org/10.1007/978-3-030-66494-7_6https://link.springer.com/10.1007/978-3-030-66494-7_6. doi:10.1007/978-3-030-66494-7_6.
- Secure Data Recording and Bio-Inspired Functional Integrity for Intelligent Robots, in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, 2018, pp. 8723--8728. URL: https://ieeexplore.ieee.org/document/8593994/. doi:10.1109/IROS.2018.8593994.
- Analysis of the Performance of Different Accountability Strategies for Autonomous Robots, in: J. J. Gude Prego, J. G. de la Puerta, P. García Bringas, H. Quintián, E. Corchado (Eds.), 14th International Conference on Computational Intelligence in Security for Information Systems and 12th International Conference on European Transnational Educational (CISIS 2021 and ICEUTE 2021), Springer International Publishing, Cham, 2022, pp. 41--51. URL: https://link.springer.com/10.1007/978-3-030-87872-6_5. doi:10.1007/978-3-030-87872-6_5.
- A Modular and Portable Black Box Recorder for Increased Transparency of Autonomous Service Robots, IEEE Robotics and Automation Letters 7 (2022) 10673--10680.
- A Survey on Blockchain in Robotics: Issues, Opportunities, Challenges and Future Directions, Journal of Network and Computer Applications 196 (2021) 103245.
- Blockchain for AI: Review and Open Research Challenges, IEEE Access 7 (2019) 10127--10149.
- Black Block Recorder: Immutable Black Box Logging for Robots via Blockchain, IEEE Robotics and Automation Letters 4 (2019) 3812--3819.
- Blockchain Technology Secures Robot Swarms: A Comparison of Consensus Protocols and Their Resilience to Byzantine Robots, Frontiers in Robotics and AI 7 (2020) 54.
- A Time-Segmented Consortium Blockchain for Robotic Event Registration, in: 2021 The 3rd International Conference on Blockchain Technology, ICBCT ’21, ACM, New York, NY, USA, 2021, pp. 117--122. URL: https://doi.org/10.1145/3460537.3460557https://dl.acm.org/doi/10.1145/3460537.3460557. doi:10.1145/3460537.3460557.
- ROS-Ethereum: A Convenient Tool to Bridge ROS and Blockchain (Ethereum), Security and Communication Networks 2022 (2022) 1--14.
- Towards A Robot Explanation System: A Survey and Our Approach to State Summarization, Storage and Querying, and Human Interface (2019).
- Trends and Trajectories for Explainable, Accountable and Intelligible Systems, in: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, ACM, New York, NY, USA, 2018, pp. 1--18. URL: https://doi.org/10.1145/3173574.3174156https://dl.acm.org/doi/10.1145/3173574.3174156. doi:10.1145/3173574.3174156.
- Cases for Explainable Software Systems: Characteristics and Examples, Proceedings of the IEEE International Conference on Requirements Engineering 2021-September (2021) 181--187.
- T. Sakai, T. Nagai, Explainable autonomous robots: a survey and perspective, Advanced Robotics 36 (2022) 219--238.
- Self-Explaining Social Robots: An Explainable Behavior Generation Architecture for Human-Robot Interaction, Frontiers in Artificial Intelligence 5 (2022) 87.
- Building the Foundation of Robot Explanation Generation Using Behavior Trees, ACM Transactions on Human-Robot Interaction 10 (2021) 1--31.
- Attention is all you need, in: Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, Curran Associates Inc., Red Hook, NY, USA, 2017, p. 6000–6010.
- A Survey of Large Language Models, arXiv preprint arXiv:2307.03109 (2023).
- K. Andriopoulos, J. Pouwelse, Augmenting LLMs with Knowledge: A survey on hallucination prevention, arXiv preprint arXiv:2309.16459 (2023).
- Retrieving Multimodal Information for Augmented Generation: A Survey, arXiv preprint arXiv:2303.10868 (2023).
- Emergent Abilities of Large Language Models, arXiv preprint arXiv:2301.00234 (2022).
- A Survey on In-context Learning, arXiv preprint arXiv:2301.00234 (2022).
- A Survey on Evaluation of Large Language Models, ACM Transactions on Intelligent Systems and Technology (2024).
- Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena, arXiv preprint arXiv:2306.05685 (2023).
- GPT-2C: A Parser for Honeypot Logs Using Large Pre-Trained Language Models, in: Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM ’21, Association for Computing Machinery, New York, NY, USA, 2022, pp. 649--653. URL: https://doi.org/10.1145/3487351.3492723. doi:10.1145/3487351.3492723.
- Using Large Language Models for Interpreting Autonomous Robots Behaviors, arXiv preprint arXiv:2304.14844 (2023).
- Exploring the performance of ROS2, in: Proceedings of the 13th International Conference on Embedded Software, EMSOFT ’16, ACM, New York, NY, USA, 2016, pp. 1--10. URL: https://doi.org/10.1145/2968478.2968502https://dl.acm.org/doi/10.1145/2968478.2968502. doi:10.1145/2968478.2968502.
- The Marathon 2: A Navigation System, in: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, 2020, pp. 2718--2725. URL: https://ieeexplore.ieee.org/document/9341207/. doi:10.1109/IROS45743.2020.9341207.
- Building the Foundation of Robot Explanation Generation Using Behavior Trees, J. Hum.-Robot Interact. 10 (2021).
- Accountability and Explainability in Robotics: A Proof of Concept for ROS 2- And Nav2-Based Mobile Robots, in: P. García Bringas, H. Pérez García, F. J. de Pisón, F. Martínez Álvarez, A. Troncoso Lora, Á. Herrero, J. L. Calvo Rolle, H. Quintián, E. Corchado (Eds.), International Joint Conference 16th International Conference on Computational Intelligence in Security for Information Systems (CISIS 2023) 14th International Conference on EUropean Transnational Education (ICEUTE 2023), Springer Nature Switzerland, Cham, 2023, pp. 3--13. URL: https://link.springer.com/10.1007/978-3-031-42519-6_1. doi:10.1007/978-3-031-42519-6_1.
- Llama 2: Open Foundation and Fine-Tuned Chat Models, arXiv preprint arXiv:2307.09288 (2023).
- Mistral 7B, arXiv preprint arXiv:2310.06825 (2023).
- Zephyr: Direct Distillation of LM Alignment, arXiv preprint arXiv:2310.16944 (2023).
- One Embedder, Any Task: Instruction-Finetuned Text Embeddings, arXiv preprint arXiv:2212.09741 (2022).
- B. Lin, Reinforcement learning and bandits for speech and language processing: Tutorial, review and outlook, Expert Systems with Applications 238 (2024) 122254.
- Sparks of Artificial General Intelligence: Early experiments with GPT-4, arXiv preprint arXiv:2303.12712 (2023).
- D. Gunning, D. W. Aha, DARPA’s Explainable Artificial Intelligence Program, AI Magazine 40 (2019) 44--58.
- Dissociating language and thought in large language models, arXiv preprint arXiv:2301.06627 (2023).