Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge (2012.06867v1)

Published 12 Dec 2020 in cs.SD, cs.LG, and eess.AS

Abstract: We held the second instaLLMent of the VoxCeleb Speaker Recognition Challenge in conjunction with Interspeech 2020. The goal of this challenge was to assess how well current speaker recognition technology is able to diarise and recognize speakers in unconstrained or `in the wild' data. It consisted of: (i) a publicly available speaker recognition and diarisation dataset from YouTube videos together with ground truth annotation and standardised evaluation software; and (ii) a virtual public challenge and workshop held at Interspeech 2020. This paper outlines the challenge, and describes the baselines, methods used, and results. We conclude with a discussion of the progress over the first instaLLMent of the challenge.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Arsha Nagrani (62 papers)
  2. Joon Son Chung (106 papers)
  3. Jaesung Huh (24 papers)
  4. Andrew Brown (31 papers)
  5. Ernesto Coto (4 papers)
  6. Weidi Xie (132 papers)
  7. Mitchell McLaren (11 papers)
  8. Andrew Zisserman (248 papers)
  9. Douglas A Reynolds (4 papers)
Citations (71)

Summary

We haven't generated a summary for this paper yet.