Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

An Empirical Evaluation of Zero Resource Acoustic Unit Discovery (1702.01360v1)

Published 5 Feb 2017 in cs.CL

Abstract: Acoustic unit discovery (AUD) is a process of automatically identifying a categorical acoustic unit inventory from speech and producing corresponding acoustic unit tokenizations. AUD provides an important avenue for unsupervised acoustic model training in a zero resource setting where expert-provided linguistic knowledge and transcribed speech are unavailable. Therefore, to further facilitate zero-resource AUD process, in this paper, we demonstrate acoustic feature representations can be significantly improved by (i) performing linear discriminant analysis (LDA) in an unsupervised self-trained fashion, and (ii) leveraging resources of other languages through building a multilingual bottleneck (BN) feature extractor to give effective cross-lingual generalization. Moreover, we perform comprehensive evaluations of AUD efficacy on multiple downstream speech applications, and their correlated performance suggests that AUD evaluations are feasible using different alternative language resources when only a subset of these evaluation resources can be available in typical zero resource applications.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Chunxi Liu (20 papers)
  2. Jinyi Yang (141 papers)
  3. Ming Sun (146 papers)
  4. Santosh Kesiraju (12 papers)
  5. Alena Rott (1 paper)
  6. Lucas Ondel (13 papers)
  7. Pegah Ghahremani (3 papers)
  8. Najim Dehak (71 papers)
  9. Lukas Burget (164 papers)
  10. Sanjeev Khudanpur (74 papers)
Citations (14)

Summary

We haven't generated a summary for this paper yet.