Prompting for a conversation: How to control a dialog model? (2209.11068v1)

Published 22 Sep 2022 in cs.CL

Abstract: Dialog modelling faces a difficult trade-off. Models are trained on a large amount of text, yet their responses need to be limited to a desired scope and style of a dialog agent. Because the datasets used to achieve the former contain language that is not compatible with the latter, pre-trained dialog models are fine-tuned on smaller curated datasets. However, the fine-tuning process robs them of the ability to produce diverse responses, eventually reducing them to dull conversation partners. In this paper we investigate if prompting can mitigate the above trade-off. Specifically, we experiment with conditioning the prompt on the query, rather than training a single prompt for all queries. By following the intuition that freezing the pre-trained LLM will conserve its expressivity, we find that compared to fine-tuning, prompting can achieve a higher BLEU score and substantially improve the diversity and novelty of the responses.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (3)

Josef Valvoda (18 papers)
Yimai Fang (4 papers)
David Vandyke (18 papers)

Citations (5)

View on Semantic Scholar

Prompting for a conversation: How to control a dialog model? (2209.11068v1)

Related Papers