Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge (2209.10357v4)

Published 21 Sep 2022 in eess.AS

Abstract: This report describes the submission system of the GIST-AiTeR team at the 2022 VoxCeleb Speaker Recognition Challenge (VoxSRC) Track 4. Our system mainly includes speech enhancement, voice activity detection , multi-scaled speaker embedding, probabilistic linear discriminant analysis-based speaker clustering, and overlapped speech detection models. We first construct four different diarization systems according to different model combinations with the best experimental efforts. Our final submission is an ensemble system of all the four systems and achieves a diarization error rate of 5.12% on the challenge evaluation set, ranked third at the diarization track of the challenge.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Dongkeon Park (3 papers)
  2. Yechan Yu (3 papers)
  3. Kyeong Wan Park (1 paper)
  4. Ji Won Kim (5 papers)
  5. Hong Kook Kim (9 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.