Tone Recognition Using Lifters and CTC

07/06/2018
by   Loren Lugosch, et al.
0

In this paper, we present a new method for recognizing tones in continuous speech for tonal languages. The method works by converting the speech signal to a cepstrogram, extracting a sequence of cepstral features using a convolutional neural network, and predicting the underlying sequence of tones using a connectionist temporal classification (CTC) network. The performance of the proposed method is evaluated on a freely available Mandarin Chinese speech corpus, AISHELL-1, and is shown to outperform the existing techniques in the literature in terms of tone error rate (TER).

READ FULL TEXT
research
06/17/2019

Speech Recognition With No Speech Or With Noisy Speech Beyond English

In this paper we demonstrate continuous noisy speech recognition using c...
research
11/15/2014

Definition of Visual Speech Element and Research on a Method of Extracting Feature Vector for Korean Lip-Reading

In this paper, we defined the viseme (visual speech element) and describ...
research
12/22/2014

Learning linearly separable features for speech recognition using convolutional neural networks

Automatic speech recognition systems usually rely on spectral-based feat...
research
03/30/2018

Detecting Alzheimer's Disease Using Gated Convolutional Neural Network from Audio Data

We propose an automatic detection method of Alzheimer's diseases using a...
research
06/05/2020

A New Method Towards Speech Files Local Features Investigation

There are a few reasons for the recent increased interest in the study o...
research
08/23/2019

Multilingual and Multimode Phone Recognition System for Indian Languages

The aim of this paper is to develop a flexible framework capable of auto...
research
04/16/2020

Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network

We propose a non-invasive and cost-effective method to automatically det...

Please sign up or login with your details

Forgot password? Click here to reset