Neural Conversation Models and How to Rein Them in: A Survey of Failures and Fixes (2308.06095v1)

Published 11 Aug 2023 in cs.CL, cs.AI, and cs.LG

Abstract: Recent conditional LLMs are able to continue any kind of text source in an often seemingly fluent way. This fact encouraged research in the area of open-domain conversational systems that are based on powerful LLMs and aim to imitate an interlocutor by generating appropriate contributions to a written dialogue. From a linguistic perspective, however, the complexity of contributing to a conversation is high. In this survey, we interpret Grice's maxims of cooperative conversation from the perspective of this specific research area and systematize the literature under the aspect of what makes a contribution appropriate: A neural conversation model has to be fluent, informative, consistent, coherent, and follow social norms. In order to ensure these qualities, recent approaches try to tame the underlying LLMs at various intervention points, such as data, training regime or decoding. Sorted by these categories and intervention points, we discuss promising attempts and suggest novel ways for future research.

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Neural Conversation Models and How to Rein Them in: A Survey of Failures and Fixes (2308.06095v1)

Summary

Related Papers