Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Explainable Behavior Cloning: Teaching Large Language Model Agents through Learning by Demonstration (2410.22916v1)

Published 30 Oct 2024 in cs.CL

Abstract: Autonomous mobile app interaction has become increasingly important with growing complexity of mobile applications. Developing intelligent agents that can effectively navigate and interact with mobile apps remains a significant challenge. In this paper, we propose an Explainable Behavior Cloning LLM Agent (EBC-LLMAgent), a novel approach that combines LLMs with behavior cloning by learning demonstrations to create intelligent and explainable agents for autonomous mobile app interaction. EBC-LLMAgent consists of three core modules: Demonstration Encoding, Code Generation, and UI Mapping, which work synergistically to capture user demonstrations, generate executable codes, and establish accurate correspondence between code and UI elements. We introduce the Behavior Cloning Chain Fusion technique to enhance the generalization capabilities of the agent. Extensive experiments on five popular mobile applications from diverse domains demonstrate the superior performance of EBC-LLMAgent, achieving high success rates in task completion, efficient generalization to unseen scenarios, and the generation of meaningful explanations.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Yanchu Guan (3 papers)
  2. Dong Wang (628 papers)
  3. Yan Wang (733 papers)
  4. Haiqing Wang (3 papers)
  5. Renen Sun (2 papers)
  6. Chenyi Zhuang (20 papers)
  7. Jinjie Gu (50 papers)
  8. Zhixuan Chu (43 papers)