Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Tone Recognition Using Lifters and CTC (1807.02465v1)

Published 6 Jul 2018 in eess.AS and cs.SD

Abstract: In this paper, we present a new method for recognizing tones in continuous speech for tonal languages. The method works by converting the speech signal to a cepstrogram, extracting a sequence of cepstral features using a convolutional neural network, and predicting the underlying sequence of tones using a connectionist temporal classification (CTC) network. The performance of the proposed method is evaluated on a freely available Mandarin Chinese speech corpus, AISHELL-1, and is shown to outperform the existing techniques in the literature in terms of tone error rate (TER).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Loren Lugosch (13 papers)
  2. Vikrant Singh Tomar (7 papers)
Citations (9)

Summary

We haven't generated a summary for this paper yet.