Gated Recurrent Unit Based Acoustic Modeling with Future Context

05/18/2018
by   Jie Li, et al.
0

The use of future contextual information is typically shown to be helpful for acoustic modeling. However, for the recurrent neural network (RNN), it's not so easy to model the future temporal context effectively, meanwhile keep lower model latency. In this paper, we attempt to design a RNN acoustic model that being capable of utilizing the future context effectively and directly, with the model latency and computation cost as low as possible. The proposed model is based on the minimal gated recurrent unit (mGRU) with an input projection layer inserted in it. Two context modules, temporal encoding and temporal convolution, are specifically designed for this architecture to model the future context. Experimental results on the Switchboard task and an internal Mandarin ASR task show that, the proposed model performs much better than long short-term memory (LSTM) and mGRU models, whereas enables online decoding with a maximum latency of 170 ms. This model even outperforms a very strong baseline, TDNN-LSTM, with smaller model latency and almost half less parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2018

Improving Gated Recurrent Unit Based Acoustic Modeling with Batch Normalization and Enlarged Context

The use of future contextual information is typically shown to be helpfu...
research
01/12/2023

Explicit Context Integrated Recurrent Neural Network for Sensor Data Applications

The development and progress in sensor, communication and computing tech...
research
05/26/2020

Comparison of Recurrent Neural Network Architectures for Wildfire Spread Modelling

Wildfire modelling is an attempt to reproduce fire behaviour. Through ac...
research
01/24/2022

A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement

In acoustic signal processing, the target signals usually carry semantic...
research
10/27/2017

Advanced LSTM: A Study about Better Time Dependency Modeling in Emotion Recognition

Long short-term memory (LSTM) is normally used in recurrent neural netwo...
research
12/06/2021

Intelligent Acoustic Module for Autonomous Vehicles using Fast Gated Recurrent approach

This paper elucidates a model for acoustic single and multi-tone classif...
research
04/11/2018

Deep Differential Recurrent Neural Networks

Due to the special gating schemes of Long Short-Term Memory (LSTM), LSTM...

Please sign up or login with your details

Forgot password? Click here to reset