A Controllable Model of Grounded Response Generation (2005.00613v2)
Abstract: Current end-to-end neural conversation models inherently lack the flexibility to impose semantic control in the response generation process, often resulting in uninteresting responses. Attempts to boost informativeness alone come at the expense of factual accuracy, as attested by pretrained LLMs' propensity to "hallucinate" facts. While this may be mitigated by access to background knowledge, there is scant guarantee of relevance and informativeness in generated responses. We propose a framework that we call controllable grounded response generation (CGRG), in which lexical control phrases are either provided by a user or automatically extracted by a control phrase predictor from dialogue context and grounding knowledge. Quantitative and qualitative results show that, using this framework, a transformer based model with a novel inductive attention mechanism, trained on a conversation-like Reddit dataset, outperforms strong generation baselines.
- Zeqiu Wu (15 papers)
- Michel Galley (50 papers)
- Chris Brockett (37 papers)
- Yizhe Zhang (127 papers)
- Xiang Gao (210 papers)
- Chris Quirk (11 papers)
- Rik Koncel-Kedziorski (19 papers)
- Jianfeng Gao (344 papers)
- Hannaneh Hajishirzi (176 papers)
- Mari Ostendorf (57 papers)
- Bill Dolan (45 papers)