Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond (2310.05513v1)

Published 9 Oct 2023 in cs.SD, cs.CL, and eess.AS

Abstract: The 2023 Multilingual Speech Universal Performance Benchmark (ML-SUPERB) Challenge expands upon the acclaimed SUPERB framework, emphasizing self-supervised models in multilingual speech recognition and language identification. The challenge comprises a research track focused on applying ML-SUPERB to specific multilingual subjects, a Challenge Track for model submissions, and a New Language Track where language resource researchers can contribute and evaluate their low-resource language data in the context of the latest progress in multilingual speech recognition. The challenge garnered 12 model submissions and 54 language corpora, resulting in a comprehensive benchmark encompassing 154 languages. The findings indicate that merely scaling models is not the definitive solution for multilingual speech tasks, and a variety of speech/voice types present significant challenges in multilingual speech processing.

References (58)

Authors (13)

Jiatong Shi (82 papers)
William Chen (49 papers)
Dan Berrebbi (10 papers)
Hsiu-Hsuan Wang (7 papers)
Wei-Ping Huang (16 papers)
En-Pei Hu (5 papers)
Ho-Lam Chuang (1 paper)
Xuankai Chang (61 papers)
Yuxun Tang (13 papers)
Shang-Wen Li (55 papers)
Abdelrahman Mohamed (59 papers)
Hung-yi Lee (327 papers)
Shinji Watanabe (416 papers)

Citations (13)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond (2310.05513v1)

Summary

Related Papers