From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models (2401.02777v2)
Abstract: This paper introduces RAISE (Reasoning and Acting through Scratchpad and Examples), an advanced architecture enhancing the integration of LLMs like GPT-4 into conversational agents. RAISE, an enhancement of the ReAct framework, incorporates a dual-component memory system, mirroring human short-term and long-term memory, to maintain context and continuity in conversations. It entails a comprehensive agent construction scenario, including phases like Conversation Selection, Scene Extraction, CoT Completion, and Scene Augmentation, leading to the LLMs Training phase. This approach appears to enhance agent controllability and adaptability in complex, multi-turn dialogues. Our preliminary evaluations in a real estate sales context suggest that RAISE has some advantages over traditional agents, indicating its potential for broader applications. This work contributes to the AI field by providing a robust framework for developing more context-aware and versatile conversational agents.
- 2023. Qwen technical report. arXiv preprint arXiv:2309.16609.
- 2009. Defining agency: Individuality, normativity, asymmetry, and spatio-temporality in action. Adaptive Behavior, 17(5):367–386.
- 2023. Sparks of artificial general intelligence: Early experiments with gpt-4. CoRR. arXiv:2303.12712.
- 2023. Dialogue chain-of-thought distillation for commonsense-aware conversational agents. arXiv preprint arXiv:2310.09343.
- 2023a. Fireact: Toward language agent fine-tuning. arXiv preprint arXiv:2310.05915.
- 2023b. T-eval: Evaluating the tool utilization capability step by step.
- 2023c. Chatcot: Tool-augmented chain-of-thought reasoning on\\\backslash\\\\backslash\chat-based large language models. arXiv preprint arXiv:2305.14323.
- 2023. Agent instructs large language models to be general zero-shot reasoners. ArXiv, abs/2310.03710.
- 2023. Zero-shot goal-directed dialogue via rl on imagined conversations. arXiv preprint arXiv:2311.05584.
- 2023. Tptu-v2: Boosting task planning and tool usage of large language model-based agents in real-world systems. arXiv preprint arXiv:2311.11315.
- 2023. Apibank: A benchmark for tool-augmented llms. arXiv preprint.
- 2023. Agentbench: Evaluating llms as agents. arXiv preprint arXiv:2308.03688.
- 2022. Webgpt: Browser-assisted question-answering with human feedback. arXiv preprint.
- 2023. Can generalist foundation models outcompete special-purpose tuning? case study in medicine. ArXiv, abs/2311.16452.
- OpenAI. 2023a. Chatgpt: Optimizing language models for dialogue. Blog post.
- OpenAI. 2023b. Gpt-4 technical report. Blog post.
- 2022. Training language models to follow instructions with human feedback.
- 2023. Kwaiagents: Generalized information-seeking agent system with large language models. arXiv preprint arXiv:2312.04889.
- 2023. Tptu: Task planning and tool usage of large language model-based ai agents. arXiv preprint arXiv:2308.03427.
- 2023. Toolformer: Language models can teach themselves to use tools. arXiv preprint.
- 2023. Character-llm: A trainable agent for role-playing. arXiv preprint arXiv:2310.10158.
- 2023. Hugginggpt: Solving ai tasks with chatgpt and its friends in huggingface. arXiv preprint arXiv:2303.17580.
- 2023. Cognitive architectures for language agents. arXiv preprint arXiv:2309.02427.
- 2023a. A survey on large language model based autonomous agents. arXiv preprint arXiv:2308.11432.
- 2023b. A survey on large language model based autonomous agents. ArXiv, abs/2308.11432.
- 2023c. Self-consistency improves chain of thought reasoning in language models. In Proceedings of ICLR.
- 2023d. Rolellm: Benchmarking, eliciting, and enhancing role-playing abilities of large language models. arXiv preprint arXiv:2310.00746.
- 2022a. Emergent abilities of large language models. Trans. Mach. Learn. Res.
- 2022b. Chain of thought prompting elicits reasoning in large language models. In Proceedings of NeurIPS.
- L. Weng. 2023. Llm-powered autonomous agents.
- 2023. The rise and potential of large language model based agents: A survey. arXiv preprint arXiv:2309.07864.
- 2023. Openagents: An open platform for language agents in the wild. arXiv preprint arXiv:2310.10634.
- 2022. React: Synergizing reasoning and acting in language models. arXiv preprint arXiv:2210.03629.
- 2023. React: Synergizing reasoning and acting in language models. arXiv preprint.
- 1995. Stanford encyclopedia of philosophy.
- 2023. Agenttuning: Enabling generalized agent abilities for llms. arXiv preprint arXiv:2310.12823.
- 2023. Expel: Llm agents are experiential learners. arXiv preprint arXiv:2308.10144.
- 2023. Least-to-most prompting enables complex reasoning in large language models. In Proceedings of ICLR.