Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Cantonese Automatic Speech Recognition Using Transfer Learning from Mandarin (1911.09271v1)

Published 21 Nov 2019 in cs.CL

Abstract: We propose a system to develop a basic automatic speech recognizer(ASR) for Cantonese, a low-resource language, through transfer learning of Mandarin, a high-resource language. We take a time-delayed neural network trained on Mandarin, and perform weight transfer of several layers to a newly initialized model for Cantonese. We experiment with the number of layers transferred, their learning rates, and pretraining i-vectors. Key findings are that this approach allows for quicker training time with less data. We find that for every epoch, log-probability is smaller for transfer learning models compared to a Cantonese-only model. The transfer learning models show slight improvement in CER.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Bryan Li (17 papers)
  2. Xinyue Wang (29 papers)
  3. Homayoon Beigi (16 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.