MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation (2308.08239v2)

Published 16 Aug 2023 in cs.CL

Abstract: We propose MemoChat, a pipeline for refining instructions that enables LLMs to effectively employ self-composed memos for maintaining consistent long-range open-domain conversations. We demonstrate a long-range open-domain conversation through iterative "memorization-retrieval-response" cycles. This requires us to carefully design tailored tuning instructions for each distinct stage. The instructions are reconstructed from a collection of public datasets to teach the LLMs to memorize and retrieve past dialogues with structured memos, leading to enhanced consistency when participating in future conversations. We invite experts to manually annotate a test set designed to evaluate the consistency of long-range conversations questions. Experiments on three testing scenarios involving both open-source and API-accessible chatbots at scale verify the efficacy of MemoChat, which outperforms strong baselines. Our codes, data and models are available here: https://github.com/LuJunru/MemoChat.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (8)

Junru Lu (15 papers)
Siyu An (9 papers)
Mingbao Lin (78 papers)
Gabriele Pergola (26 papers)
Yulan He (113 papers)
Di Yin (26 papers)
Xing Sun (93 papers)
Yunsheng Wu (25 papers)

Citations (22)

View on Semantic Scholar

GitHub

GitHub - LuJunru/MemoChat: MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation (15 stars)

MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation (2308.08239v2)

Related Papers

GitHub