Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unsupervised Slot Schema Induction for Task-oriented Dialog (2205.04515v1)

Published 9 May 2022 in cs.CL

Abstract: Carefully-designed schemas describing how to collect and annotate dialog corpora are a prerequisite towards building task-oriented dialog systems. In practical applications, manually designing schemas can be error-prone, laborious, iterative, and slow, especially when the schema is complicated. To alleviate this expensive and time consuming process, we propose an unsupervised approach for slot schema induction from unlabeled dialog corpora. Leveraging in-domain LLMs and unsupervised parsing structures, our data-driven approach extracts candidate slots without constraints, followed by coarse-to-fine clustering to induce slot types. We compare our method against several strong supervised baselines, and show significant performance improvement in slot schema induction on MultiWoz and SGD datasets. We also demonstrate the effectiveness of induced schemas on downstream applications including dialog state tracking and response generation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Dian Yu (78 papers)
  2. Mingqiu Wang (20 papers)
  3. Yuan Cao (201 papers)
  4. Izhak Shafran (30 papers)
  5. Laurent El Shafey (15 papers)
  6. Hagen Soltau (19 papers)
Citations (13)

Summary

We haven't generated a summary for this paper yet.