Multilingual acoustic word embeddings for zero-resource languages (2401.10543v2)

Published 19 Jan 2024 in eess.AS, cs.CL, and cs.SD

Abstract: This research addresses the challenge of developing speech applications for zero-resource languages that lack labelled data. It specifically uses acoustic word embedding (AWE) -- fixed-dimensional representations of variable-duration speech segments -- employing multilingual transfer, where labelled data from several well-resourced languages are used for pertaining. The study introduces a new neural network that outperforms existing AWE models on zero-resource languages. It explores the impact of the choice of well-resourced languages. AWEs are applied to a keyword-spotting system for hate speech detection in Swahili radio broadcasts, demonstrating robustness in real-world scenarios. Additionally, novel semantic AWE models improve semantic query-by-example search.

References (137)

Authors (1)

Christiaan Jacobs (7 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/ArxivSound/status/1749259087974465632

Multilingual acoustic word embeddings for zero-resource languages (2401.10543v2)

Summary

Related Papers

Tweets