Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The Zero Resource Speech Challenge 2021: Spoken language modelling (2104.14700v2)

Published 29 Apr 2021 in cs.CL and cs.AI

Abstract: We present the Zero Resource Speech Challenge 2021, which asks participants to learn a LLM directly from audio, without any text or labels. The challenge is based on the Libri-light dataset, which provides up to 60k hours of audio from English audio books without any associated text. We provide a pipeline baseline system consisting on an encoder based on contrastive predictive coding (CPC), a quantizer ($k$-means) and a standard LLM (BERT or LSTM). The metrics evaluate the learned representations at the acoustic (ABX discrimination), lexical (spot-the-word), syntactic (acceptability judgment) and semantic levels (similarity judgment). We present an overview of the eight submitted systems from four groups and discuss the main results.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Ewan Dunbar (22 papers)
  2. Mathieu Bernard (10 papers)
  3. Nicolas Hamilakis (3 papers)
  4. Tu Anh Nguyen (12 papers)
  5. Maureen de Seyssel (11 papers)
  6. Morgane Rivière (26 papers)
  7. Eugene Kharitonov (25 papers)
  8. Emmanuel Dupoux (81 papers)
  9. Patricia Rozé (2 papers)
Citations (42)

Summary

We haven't generated a summary for this paper yet.