Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models (2403.09567v3)

Published 14 Mar 2024 in cs.RO and cs.AI

Abstract: The deployment of autonomous agents in environments involving human interaction has increasingly raised security concerns. Consequently, understanding the circumstances behind an event becomes critical, requiring the development of capabilities to justify their behaviors to non-expert users. Such explanations are essential in enhancing trustworthiness and safety, acting as a preventive measure against failures, errors, and misunderstandings. Additionally, they contribute to improving communication, bridging the gap between the agent and the user, thereby improving the effectiveness of their interactions. This work presents an accountability and explainability architecture implemented for ROS-based mobile robots. The proposed solution consists of two main components. Firstly, a black box-like element to provide accountability, featuring anti-tampering properties achieved through blockchain technology. Secondly, a component in charge of generating natural language explanations by harnessing the capabilities of LLMs over the data contained within the previously mentioned black box. The study evaluates the performance of our solution in three different scenarios, each involving autonomous agent navigation functionalities. This evaluation includes a thorough examination of accountability and explainability metrics, demonstrating the effectiveness of our approach in using accountable data from robot actions to obtain coherent, accurate and understandable explanations, even when facing challenges inherent in the use of autonomous agents in real-world scenarios.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (69)
  1. Explainable Goal-driven Agents and Robots - A Comprehensive Review, ACM Computing Surveys 55 (2023) 1–41.
  2. Towards Providing Explanations for AI Planner Decisions, CoRR abs/1810.0 (2018).
  3. A. Rosenfeld, A. Richardson, Explainability in human–agent systems, Autonomous Agents and Multi-Agent Systems 33 (2019) 673–705.
  4. Explainable robotic systems: understanding goal-driven actions in a reinforcement learning scenario, Neural Computing and Applications 35 (2023) 18113–18130.
  5. Y. Al-Slais, M. Ali, Robotic Process Automation and Intelligent Automation Security Challenges: A Review, in: 2023 International Conference On Cyber Management And Engineering (CyMaEn), IEEE, 2023, pp. 71–77. URL: https://ieeexplore.ieee.org/document/10050996/. doi:10.1109/CyMaEn57228.2023.10050996.
  6. Secure and transparent audit logs with BlockAudit, Journal of Network and Computer Applications 145 (2019) 102406.
  7. Blockchain for Embedded System Accountability, in: 2021 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), IEEE, 2021, pp. 1–5. URL: https://ieeexplore.ieee.org/document/9461143/. doi:10.1109/ICBC51069.2021.9461143.
  8. Analysis of the Performance of Different Accountability Strategies for Autonomous Robots, in: J. J. Gude Prego, J. G. de la Puerta, P. García Bringas, H. Quintián, E. Corchado (Eds.), 14th International Conference on Computational Intelligence in Security for Information Systems and 12th International Conference on European Transnational Educational (CISIS 2021 and ICEUTE 2021), Springer International Publishing, Cham, 2022, pp. 41–51. URL: https://link.springer.com/10.1007/978-3-030-87872-6_5. doi:10.1007/978-3-030-87872-6_5.
  9. A secure and auditable logging infrastructure based on a permissioned blockchain, Computers & Security 87 (2019) 101602.
  10. A blockchain integration to support failures prediction from log files in multi-agent systems technology, Expert Systems with Applications 240 (2024) 122122.
  11. M. Iqbal, R. Matulevičius, Comparison of Blockchain-Based Solutions to Mitigate Data Tampering Security Risk, in: C. Di Ciccio, R. Gabryelczyk, L. García-Bañuelos, T. Hernaus, R. Hull, M. Indihar Štemberger, A. K Ho, M. Staples (Eds.), Business Process Management: Blockchain and Central and Eastern Europe Forum, Springer International Publishing, Cham, 2019, pp. 13–28. URL: http://link.springer.com/10.1007/978-3-030-30429-4_2. doi:10.1007/978-3-030-30429-4_2.
  12. Y. Shehu, R. Harper, Enhancements to Language Modeling Techniques for Adaptable Log Message Classification, IEEE Transactions on Network and Service Management 19 (2022) 4662–4675.
  13. Robust and Transferable Anomaly Detection in Log Data using Pre-Trained Language Models, in: 2021 IEEE/ACM International Workshop on Cloud Intelligence (CloudIntelligence), IEEE, 2021, pp. 19–24. URL: https://ieeexplore.ieee.org/document/9527018/. doi:10.1109/CloudIntelligence52565.2021.00013.
  14. Explainable agents and robots: Results from a systematic literature review, Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2 (2019) 1078–1088.
  15. Explainable Agency for Intelligent Autonomous Systems, Proceedings of the AAAI Conference on Artificial Intelligence 31 (2017) 4762–4763.
  16. T. Kocmi, C. Federmann, Large Language Models Are State-of-the-Art Evaluators of Translation Quality, arXiv preprint arXiv:2302.14520 (2023).
  17. Benchmarking Large Language Models for News Summarization, Transactions of the Association for Computational Linguistics 12 (2024) 39–57.
  18. Automatic Generation of Programming Exercises and Code Explanations Using Large Language Models, in: Proceedings of the 2022 ACM Conference on International Computing Education Research - Volume 1, ICER ’22, ACM, New York, NY, USA, 2022, pp. 27–43. URL: https://doi.org/10.1145/3501385.3543957https://dl.acm.org/doi/10.1145/3501385.3543957. doi:10.1145/3501385.3543957.
  19. Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation, arXiv preprint arXiv:2307.11019 (2023).
  20. A comprehensive survey on blockchain technology, Sustainable Energy Technologies and Assessments 52 (2022) 102039.
  21. H. Guo, X. Yu, A survey on blockchain technology and its security, Blockchain: Research and Applications 3 (2022) 100067.
  22. Permissioned vs. Permissionless Blockchain: How and Why There Is Only One Right Choice, Journal of Software 16 (2021) 95–106.
  23. Proof-of-Stake Consensus Mechanisms for Future Blockchain Networks: Fundamentals, Applications and Opportunities, IEEE Access 7 (2019) 85727–85745.
  24. A. I. Sanka, R. C. Cheung, A systematic review of blockchain scalability: Issues, solutions, analysis and future research, Journal of Network and Computer Applications 195 (2021) 103232.
  25. Blockchain-based trusted accountability in the maintenance of medical imaging equipment, Expert Systems with Applications 241 (2024) 122718.
  26. An Overview of Smart Contract and Use Cases in Blockchain Technology, in: 2018 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT), IEEE, 2018, pp. 1–4. URL: https://ieeexplore.ieee.org/document/8494045/. doi:10.1109/ICCCNT.2018.8494045.
  27. BlockIPFS - Blockchain-Enabled Interplanetary File System for Forensic and Trusted Data Traceability, in: 2019 IEEE International Conference on Blockchain (Blockchain), IEEE, 2019, pp. 18–25. URL: https://ieeexplore.ieee.org/document/8946164/. doi:10.1109/Blockchain.2019.00012.
  28. L. Shekhtman, E. Waisbard, EngraveChain: Tamper-proof distributed log system, in: Proceedings of the 2nd Workshop on Blockchain-enabled Networked Sensor, BlockSys’19, ACM, New York, NY, USA, 2019, pp. 8–14. URL: https://doi.org/10.1145/3362744.3363346https://dl.acm.org/doi/10.1145/3362744.3363346. doi:10.1145/3362744.3363346.
  29. A Blockchain-Facilitated Secure Sensing Data Processing and Logging System, IEEE Access 11 (2023) 21712–21728.
  30. Immutable Log Storage as a Service on Private and Public Blockchains, IEEE Transactions on Services Computing 16 (2023) 356–369.
  31. Permission less Block chains and Secure Logging, in: 2019 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), IEEE, 2019, pp. 56–60. URL: https://ieeexplore.ieee.org/document/8751306/. doi:10.1109/BLOC.2019.8751306.
  32. Robot Accident Investigation: A Case Study in Responsible Robotics, in: A. Cavalcanti, B. Dongol, R. Hierons, J. Timmis, J. Woodcock (Eds.), Software Engineering for Robotics, Springer International Publishing, Cham, 2021, pp. 165–187. URL: https://doi.org/10.1007/978-3-030-66494-7_6https://link.springer.com/10.1007/978-3-030-66494-7_6. doi:10.1007/978-3-030-66494-7_6.
  33. Secure Data Recording and Bio-Inspired Functional Integrity for Intelligent Robots, in: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, 2018, pp. 8723--8728. URL: https://ieeexplore.ieee.org/document/8593994/. doi:10.1109/IROS.2018.8593994.
  34. Analysis of the Performance of Different Accountability Strategies for Autonomous Robots, in: J. J. Gude Prego, J. G. de la Puerta, P. García Bringas, H. Quintián, E. Corchado (Eds.), 14th International Conference on Computational Intelligence in Security for Information Systems and 12th International Conference on European Transnational Educational (CISIS 2021 and ICEUTE 2021), Springer International Publishing, Cham, 2022, pp. 41--51. URL: https://link.springer.com/10.1007/978-3-030-87872-6_5. doi:10.1007/978-3-030-87872-6_5.
  35. A Modular and Portable Black Box Recorder for Increased Transparency of Autonomous Service Robots, IEEE Robotics and Automation Letters 7 (2022) 10673--10680.
  36. A Survey on Blockchain in Robotics: Issues, Opportunities, Challenges and Future Directions, Journal of Network and Computer Applications 196 (2021) 103245.
  37. Blockchain for AI: Review and Open Research Challenges, IEEE Access 7 (2019) 10127--10149.
  38. Black Block Recorder: Immutable Black Box Logging for Robots via Blockchain, IEEE Robotics and Automation Letters 4 (2019) 3812--3819.
  39. Blockchain Technology Secures Robot Swarms: A Comparison of Consensus Protocols and Their Resilience to Byzantine Robots, Frontiers in Robotics and AI 7 (2020) 54.
  40. A Time-Segmented Consortium Blockchain for Robotic Event Registration, in: 2021 The 3rd International Conference on Blockchain Technology, ICBCT ’21, ACM, New York, NY, USA, 2021, pp. 117--122. URL: https://doi.org/10.1145/3460537.3460557https://dl.acm.org/doi/10.1145/3460537.3460557. doi:10.1145/3460537.3460557.
  41. ROS-Ethereum: A Convenient Tool to Bridge ROS and Blockchain (Ethereum), Security and Communication Networks 2022 (2022) 1--14.
  42. Towards A Robot Explanation System: A Survey and Our Approach to State Summarization, Storage and Querying, and Human Interface (2019).
  43. Trends and Trajectories for Explainable, Accountable and Intelligible Systems, in: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, ACM, New York, NY, USA, 2018, pp. 1--18. URL: https://doi.org/10.1145/3173574.3174156https://dl.acm.org/doi/10.1145/3173574.3174156. doi:10.1145/3173574.3174156.
  44. Cases for Explainable Software Systems: Characteristics and Examples, Proceedings of the IEEE International Conference on Requirements Engineering 2021-September (2021) 181--187.
  45. T. Sakai, T. Nagai, Explainable autonomous robots: a survey and perspective, Advanced Robotics 36 (2022) 219--238.
  46. Self-Explaining Social Robots: An Explainable Behavior Generation Architecture for Human-Robot Interaction, Frontiers in Artificial Intelligence 5 (2022) 87.
  47. Building the Foundation of Robot Explanation Generation Using Behavior Trees, ACM Transactions on Human-Robot Interaction 10 (2021) 1--31.
  48. Attention is all you need, in: Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, Curran Associates Inc., Red Hook, NY, USA, 2017, p. 6000–6010.
  49. A Survey of Large Language Models, arXiv preprint arXiv:2307.03109 (2023).
  50. K. Andriopoulos, J. Pouwelse, Augmenting LLMs with Knowledge: A survey on hallucination prevention, arXiv preprint arXiv:2309.16459 (2023).
  51. Retrieving Multimodal Information for Augmented Generation: A Survey, arXiv preprint arXiv:2303.10868 (2023).
  52. Emergent Abilities of Large Language Models, arXiv preprint arXiv:2301.00234 (2022).
  53. A Survey on In-context Learning, arXiv preprint arXiv:2301.00234 (2022).
  54. A Survey on Evaluation of Large Language Models, ACM Transactions on Intelligent Systems and Technology (2024).
  55. Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena, arXiv preprint arXiv:2306.05685 (2023).
  56. GPT-2C: A Parser for Honeypot Logs Using Large Pre-Trained Language Models, in: Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM ’21, Association for Computing Machinery, New York, NY, USA, 2022, pp. 649--653. URL: https://doi.org/10.1145/3487351.3492723. doi:10.1145/3487351.3492723.
  57. Using Large Language Models for Interpreting Autonomous Robots Behaviors, arXiv preprint arXiv:2304.14844 (2023).
  58. Exploring the performance of ROS2, in: Proceedings of the 13th International Conference on Embedded Software, EMSOFT ’16, ACM, New York, NY, USA, 2016, pp. 1--10. URL: https://doi.org/10.1145/2968478.2968502https://dl.acm.org/doi/10.1145/2968478.2968502. doi:10.1145/2968478.2968502.
  59. The Marathon 2: A Navigation System, in: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, 2020, pp. 2718--2725. URL: https://ieeexplore.ieee.org/document/9341207/. doi:10.1109/IROS45743.2020.9341207.
  60. Building the Foundation of Robot Explanation Generation Using Behavior Trees, J. Hum.-Robot Interact. 10 (2021).
  61. Accountability and Explainability in Robotics: A Proof of Concept for ROS 2- And Nav2-Based Mobile Robots, in: P. García Bringas, H. Pérez García, F. J. de Pisón, F. Martínez Álvarez, A. Troncoso Lora, Á. Herrero, J. L. Calvo Rolle, H. Quintián, E. Corchado (Eds.), International Joint Conference 16th International Conference on Computational Intelligence in Security for Information Systems (CISIS 2023) 14th International Conference on EUropean Transnational Education (ICEUTE 2023), Springer Nature Switzerland, Cham, 2023, pp. 3--13. URL: https://link.springer.com/10.1007/978-3-031-42519-6_1. doi:10.1007/978-3-031-42519-6_1.
  62. Llama 2: Open Foundation and Fine-Tuned Chat Models, arXiv preprint arXiv:2307.09288 (2023).
  63. Mistral 7B, arXiv preprint arXiv:2310.06825 (2023).
  64. Zephyr: Direct Distillation of LM Alignment, arXiv preprint arXiv:2310.16944 (2023).
  65. One Embedder, Any Task: Instruction-Finetuned Text Embeddings, arXiv preprint arXiv:2212.09741 (2022).
  66. B. Lin, Reinforcement learning and bandits for speech and language processing: Tutorial, review and outlook, Expert Systems with Applications 238 (2024) 122254.
  67. Sparks of Artificial General Intelligence: Early experiments with GPT-4, arXiv preprint arXiv:2303.12712 (2023).
  68. D. Gunning, D. W. Aha, DARPA’s Explainable Artificial Intelligence Program, AI Magazine 40 (2019) 44--58.
  69. Dissociating language and thought in large language models, arXiv preprint arXiv:2301.06627 (2023).

Summary

  • The paper introduces a dual-component system that uses blockchain for immutable event logging and LLMs for generating natural language explanations.
  • The methodology rigorously evaluates the system through ROS-based mobile robot scenarios, confirming effective traceability and clear accountability.
  • Experiments demonstrate that the integrated approach enhances trust in autonomous agents with minimal performance impact and resource overhead.

Enhancing Trust in Autonomous Agents through Blockchain and LLMs

Introduction

With autonomous agents increasingly coexisting with humans, the importance of ensuring their reliability and transparency has never been more apparent. Addressing security concerns integral to the deployment of these agents in human-centric environments necessitates a robust framework that not only guarantees the traceability of actions but also facilitates a deep understanding of these actions by non-expert users. In this regard, Laura Fernández-Becerra et al. propose an architecture leveraging blockchain technology for immutable logging and LLMs for natural language explanations. This combination aims to enhance both accountability and explainability of autonomous agents, focusing on ROS-based mobile robots.

System Architecture

The paper delineates a two-fold architecture: a black box-like component achieving accountability through blockchain technology, and a component responsible for generating natural language explanations utilizing LLMs. The black box component stores event logs with anti-tampering properties, ensuring data integrity and facilitating the assignment of responsibility in case of anomalies. Conversely, the explainability component leverages the data stored within the black box to generate understandable explanations for the robot's actions. An intermediate component processes and filters the raw data from the black box, refining the information for more effective explanation generation.

Experimental Setup and Evaluation

The evaluation of the proposed solution is rigorous and thorough, covering three different scenarios involving autonomous agent navigation functionalities. First, the accountability aspect was assessed across different methods of blockchain storage, examining the trade-offs between data granularity and system performance. The findings reveal a negligible impact on performance metrics such as CPU, RAM usage, and system load, validating the feasibility of integrating blockchain for this purpose. Subsequently, the effectiveness of the explainability component was explored through a multitude of questions, examining the coherence, accuracy, and helpfulness of the generated explanations.

Implications and Future Directions

The paper underscores the potential of utilizing blockchain and LLMs to address the dual challenge of accountability and explainability in autonomous agents. By providing a mechanism for immutable event logging and generating accessible explanations, this research paves the way for enhancing trust and safety in human-agent interactions. Future work might explore refining the retrieval mechanism of the RAG setup to improve answer relevancy and accuracy further. Additionally, extending the architecture's applicability beyond ROS-based mobile robots to other domains of autonomous agents represents an intriguing avenue for research.

Conclusion

This architecture represents a significant stride toward reconciling the opaqueness often associated with autonomous agents' functionalities. By ensuring actions are not only traceable but also interpretable to everyday users, the research contributes to demystifying the complexity of autonomous systems. The paper sets a precedent for leveraging advanced technologies like blockchain and LLMs in the field of robotics, highlighting their utility in fostering a safer and more transparent integration of autonomous agents into societal fabrics.