Knowledge Distillation for Small-footprint Highway Networks

08/02/2016
by   Liang Lu, et al.
0

Deep learning has significantly advanced state-of-the-art of speech recognition in the past few years. However, compared to conventional Gaussian mixture acoustic models, neural network models are usually much larger, and are therefore not very deployable in embedded devices. Previously, we investigated a compact highway deep neural network (HDNN) for acoustic modelling, which is a type of depth-gated feedforward neural network. We have shown that HDNN-based acoustic models can achieve comparable recognition accuracy with much smaller number of model parameters compared to plain deep neural network (DNN) acoustic models. In this paper, we push the boundary further by leveraging on the knowledge distillation technique that is also known as teacher-student training, i.e., we train the compact HDNN model with the supervision of a high accuracy cumbersome model. Furthermore, we also investigate sequence training and adaptation in the context of teacher-student training. Our experiments were performed on the AMI meeting speech recognition corpus. With this technique, we significantly improved the recognition accuracy of the HDNN acoustic model with less than 0.8 million parameters, and narrowed the gap between this model and the plain DNN with 30 million parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2016

Small-footprint Highway Deep Neural Networks for Speech Recognition

State-of-the-art speech recognition systems typically employ neural netw...
research
12/14/2015

Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition

For speech recognition, deep neural networks (DNNs) have significantly i...
research
07/07/2016

Sequence Training and Adaptation of Highway Deep Neural Networks

Highway deep neural network (HDNN) is a type of depth-gated feedforward ...
research
01/24/2022

Investigation of Deep Neural Network Acoustic Modelling Approaches for Low Resource Accented Mandarin Speech Recognition

The Mandarin Chinese language is known to be strongly influenced by a ri...
research
03/25/2022

Impact of Dataset on Acoustic Models for Automatic Speech Recognition

In Automatic Speech Recognition, GMM-HMM had been widely used for acoust...
research
07/01/2019

Compression of Acoustic Event Detection Models With Quantized Distillation

Acoustic Event Detection (AED), aiming at detecting categories of events...
research
06/28/2017

Toward Computation and Memory Efficient Neural Network Acoustic Models with Binary Weights and Activations

Neural network acoustic models have significantly advanced state of the ...

Please sign up or login with your details

Forgot password? Click here to reset