Comparison of SVD and factorized TDNN approaches for speech to text

10/13/2021
by   Jeffrey Josanne Michael, et al.
0

This work concentrates on reducing the RTF and word error rate of a hybrid HMM-DNN. Our baseline system uses an architecture with TDNN and LSTM layers. We find this architecture particularly useful for lightly reverberated environments. However, these models tend to demand more computation than is desirable. In this work, we explore alternate architectures employing singular value decomposition (SVD) is applied to the TDNN layers to reduce the RTF, as well as to the affine transforms of every LSTM cell. We compare this approach with specifying bottleneck layers similar to those introduced by SVD before training. Additionally, we reduced the search space of the decoding graph to make it a better fit to operate in real-time applications. We report -61.57 relative reduction in RTF and almost 1 architecture trained on Fisher data along with reverberated versions of this dataset in order to match one of our target test distributions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2023

Deep Learning Weight Pruning with RMT-SVD: Increasing Accuracy and Reducing Overfitting

In this work, we present some applications of random matrix theory for t...
research
11/28/2018

SVD-PHAT: A Fast Sound Source Localization Method

This paper introduces a new localization method called SVD-PHAT. The SVD...
research
01/18/2021

Tiny Transducer: A Highly-efficient Speech Recognition Model on Edge Devices

This paper proposes an extremely lightweight phone-based transducer mode...
research
12/24/2019

Singular Value Decomposition in Sobolev Spaces: Part II

Under certain conditions, an element of a tensor product space can be id...
research
12/02/2020

Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement

Recurrent neural networks using the LSTM architecture can achieve signif...
research
06/30/2022

Language model compression with weighted low-rank factorization

Factorizing a large matrix into small matrices is a popular strategy for...
research
09/01/2017

Look-Ahead in the Two-Sided Reduction to Compact Band Forms for Symmetric Eigenvalue Problems and the SVD

We address the reduction to compact band forms, via unitary similarity t...

Please sign up or login with your details

Forgot password? Click here to reset