Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

STOP: A dataset for Spoken Task Oriented Semantic Parsing (2207.10643v3)

Published 29 Jun 2022 in cs.CL, cs.AI, cs.SD, and eess.AS

Abstract: End-to-end spoken language understanding (SLU) predicts intent directly from audio using a single model. It promises to improve the performance of assistant systems by leveraging acoustic information lost in the intermediate textual representation and preventing cascading errors from Automatic Speech Recognition (ASR). Further, having one unified model has efficiency advantages when deploying assistant systems on-device. However, the limited number of public audio datasets with semantic parse labels hinders the research progress in this area. In this paper, we release the Spoken Task-Oriented semantic Parsing (STOP) dataset, the largest and most complex SLU dataset to be publicly available. Additionally, we define low-resource splits to establish a benchmark for improving SLU when limited labeled data is available. Furthermore, in addition to the human-recorded audio, we are releasing a TTS-generated version to benchmark the performance for low-resource domain adaptation of end-to-end SLU systems. Initial experimentation show end-to-end SLU models performing slightly worse than their cascaded counterparts, which we hope encourages future work in this direction.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (15)
  1. Paden Tomasello (17 papers)
  2. Akshat Shrivastava (25 papers)
  3. Daniel Lazar (7 papers)
  4. Duc Le (46 papers)
  5. Adithya Sagar (10 papers)
  6. Ali Elkahky (6 papers)
  7. Jade Copet (26 papers)
  8. Wei-Ning Hsu (76 papers)
  9. Yossi Adi (96 papers)
  10. Robin Algayres (14 papers)
  11. Tu Ahn Nguyen (1 paper)
  12. Emmanuel Dupoux (81 papers)
  13. Luke Zettlemoyer (225 papers)
  14. Abdelrahman Mohamed (59 papers)
  15. Po-chun Hsu (25 papers)
Citations (34)

Summary

We haven't generated a summary for this paper yet.