Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems (2110.06800v3)

Published 13 Oct 2021 in cs.CL

Abstract: Zero/few-shot transfer to unseen services is a critical challenge in task-oriented dialogue research. The Schema-Guided Dialogue (SGD) dataset introduced a paradigm for enabling models to support any service in zero-shot through schemas, which describe service APIs to models in natural language. We explore the robustness of dialogue systems to linguistic variations in schemas by designing SGD-X - a benchmark extending SGD with semantically similar yet stylistically diverse variants for every schema. We observe that two top state tracking models fail to generalize well across schema variants, measured by joint goal accuracy and a novel metric for measuring schema sensitivity. Additionally, we present a simple model-agnostic data augmentation method to improve schema robustness.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Harrison Lee (8 papers)
  2. Raghav Gupta (24 papers)
  3. Abhinav Rastogi (29 papers)
  4. Yuan Cao (201 papers)
  5. Bin Zhang (227 papers)
  6. Yonghui Wu (115 papers)
Citations (32)