Tied Probabilistic Linear Discriminant Analysis for Speech Recognition

11/04/2014
by   Liang Lu, et al.
0

Acoustic models using probabilistic linear discriminant analysis (PLDA) capture the correlations within feature vectors using subspaces which do not vastly expand the model. This allows high dimensional and correlated feature spaces to be used, without requiring the estimation of multiple high dimension covariance matrices. In this letter we extend the recently presented PLDA mixture model for speech recognition through a tied PLDA approach, which is better able to control the model size to avoid overfitting. We carried out experiments using the Switchboard corpus, with both mel frequency cepstral coefficient features and bottleneck feature derived from a deep neural network. Reductions in word error rate were obtained by using tied PLDA, compared with the PLDA mixture model, subspace Gaussian mixture models, and deep neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/18/2017

Deep Gaussian Mixture Models

Deep learning is a hierarchical inference method formed by subsequent mu...
research
10/18/2016

Small-footprint Highway Deep Neural Networks for Speech Recognition

State-of-the-art speech recognition systems typically employ neural netw...
research
02/24/2016

Accent Classification with Phonetic Vowel Representation

Previous accent classification research focused mainly on detecting acce...
research
02/24/2016

Improved Accent Classification Combining Phonetic Vowels with Acoustic Features

Researches have shown accent classification can be improved by integrati...
research
05/03/2018

Supervector Compression Strategies to Speed up I-Vector System Development

The front-end factor analysis (FEFA), an extension of principal componen...
research
10/18/2016

Low-rank and Sparse Soft Targets to Learn Better DNN Acoustic Models

Conventional deep neural networks (DNN) for speech acoustic modeling rel...
research
12/25/2017

Leveraging Native Language Speech for Accent Identification using Deep Siamese Networks

The problem of automatic accent identification is important for several ...

Please sign up or login with your details

Forgot password? Click here to reset