Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability (1906.07601v1)

Published 18 Jun 2019 in cs.CL, cs.SD, and eess.AS

Abstract: We present an end-to-end approach to extract semantic concepts directly from the speech audio signal. To overcome the lack of data available for this spoken language understanding approach, we investigate the use of a transfer learning strategy based on the principles of curriculum learning. This approach allows us to exploit out-of-domain data that can help to prepare a fully neural architecture. Experiments are carried out on the French MEDIA and PORTMEDIA corpora and show that this end-to-end SLU approach reaches the best results ever published on this task. We compare our approach to a classical pipeline approach that uses ASR, POS tagging, lemmatizer, chunker... and other NLP tools that aim to enrich ASR outputs that feed an SLU text to concepts system. Last, we explore the promising capacity of our end-to-end SLU approach to address the problem of domain portability.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Antoine Caubrière (9 papers)
  2. Natalia Tomashenko (32 papers)
  3. Antoine Laurent (22 papers)
  4. Emmanuel Morin (13 papers)
  5. Nathalie Camelin (4 papers)
  6. Yannick Estève (45 papers)
Citations (54)

Summary

We haven't generated a summary for this paper yet.