GODEL: Large-Scale Pre-Training for Goal-Directed Dialog (2206.11309v1)

Published 22 Jun 2022 in cs.CL

Abstract: We introduce GODEL (Grounded Open Dialogue LLM), a large pre-trained LLM for dialog. In contrast with earlier models such as DialoGPT, GODEL leverages a new phase of grounded pre-training designed to better support adapting GODEL to a wide range of downstream dialog tasks that require information external to the current conversation (e.g., a database or document) to produce good responses. Experiments against an array of benchmarks that encompass task-oriented dialog, conversational QA, and grounded open-domain dialog show that GODEL outperforms state-of-the-art pre-trained dialog models in few-shot fine-tuning setups, in terms of both human and automatic evaluation. A novel feature of our evaluation methodology is the introduction of a notion of utility that assesses the usefulness of responses (extrinsic evaluation) in addition to their communicative features (intrinsic evaluation). We show that extrinsic evaluation offers improved inter-annotator agreement and correlation with automated metrics. Code and data processing scripts are publicly available.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (9)

Baolin Peng (72 papers)
Michel Galley (50 papers)
Pengcheng He (60 papers)
Chris Brockett (37 papers)
Lars Liden (12 papers)
Elnaz Nouri (14 papers)
Zhou Yu (206 papers)
Bill Dolan (45 papers)
Jianfeng Gao (344 papers)

Citations (67)

View on Semantic Scholar

GODEL: Large-Scale Pre-Training for Goal-Directed Dialog (2206.11309v1)

Related Papers