Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition

01/24/2022
by   Xurong Xie, et al.
0

The Mandarin Chinese language is known to be strongly influenced by a rich set of regional accents, while Mandarin speech with each accent is quite low resource. Hence, an important task in Mandarin speech recognition is to appropriately model the acoustic variabilities imposed by accents. In this paper, an investigation of implicit and explicit use of accent information on a range of deep neural network (DNN) based acoustic modelling techniques is conducted. Meanwhile, approaches of multi-accent modelling including multi-style training, multi-accent decision tree state tying, DNN tandem and multi-level adaptive network (MLAN) tandem hidden Markov model (HMM) modelling are combined and compared in this paper. On a low resource accented Mandarin speech recognition task consisting of four regional accents, an improved MLAN tandem HMM systems explicitly leveraging the accent information was proposed and significantly outperformed the baseline accent independent DNN tandem systems by 0.8 sequence level discriminative training and adaptation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/11/2016

Environmental Noise Embeddings for Robust Speech Recognition

We propose a novel deep neural network architecture for speech recogniti...
research
08/02/2016

Knowledge Distillation for Small-footprint Highway Networks

Deep learning has significantly advanced state-of-the-art of speech reco...
research
01/22/2016

Exploiting Low-dimensional Structures to Enhance DNN Based Acoustic Modeling in Speech Recognition

We propose to model the acoustic space of deep neural network (DNN) clas...
research
06/19/2018

A Survey of Recent DNN Architectures on the TIMIT Phone Recognition Task

In this survey paper, we have evaluated several recent deep neural netwo...
research
10/18/2016

Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models

Conventional deep neural networks (DNN) for speech acoustic modeling rel...
research
07/01/2016

Moving Toward High Precision Dynamical Modelling in Hidden Markov Models

Hidden Markov Model (HMM) is often regarded as the dynamical model of ch...

Please sign up or login with your details

Forgot password? Click here to reset