Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots (2203.10759v1)

Published 21 Mar 2022 in cs.CL, cs.AI, and cs.LG

Abstract: A slot value might be provided segment by segment over multiple-turn interactions in a dialog, especially for some important information such as phone numbers and names. It is a common phenomenon in daily life, but little attention has been paid to it in previous work. To fill the gap, this paper defines a new task named Sub-Slot based Task-Oriented Dialog (SSTOD) and builds a Chinese dialog dataset SSD for boosting research on SSTOD. The dataset includes a total of 40K dialogs and 500K utterances from four different domains: Chinese names, phone numbers, ID numbers and license plate numbers. The data is well annotated with sub-slot values, slot values, dialog states and actions. We find some new linguistic phenomena and interactive manners in SSTOD which raise critical challenges of building dialog agents for the task. We test three state-of-the-art dialog models on SSTOD and find they cannot handle the task well on any of the four domains. We also investigate an improved model by involving slot knowledge in a plug-in manner. More work should be done to meet the new challenges raised from SSTOD which widely exists in real-life applications. The dataset and code are publicly available via https://github.com/shunjiu/SSTOD.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Sai Zhang (18 papers)
  2. Yuwei Hu (15 papers)
  3. Yuchuan Wu (33 papers)
  4. Jiaman Wu (11 papers)
  5. Yongbin Li (128 papers)
  6. Jian Sun (415 papers)
  7. Caixia Yuan (13 papers)
  8. Xiaojie Wang (108 papers)
Citations (14)
Github Logo Streamline Icon: https://streamlinehq.com

GitHub