Tiny Transducer: A Highly-efficient Speech Recognition Model on Edge Devices

01/18/2021
by   Yuekai Zhang, et al.
0

This paper proposes an extremely lightweight phone-based transducer model with a tiny decoding graph on edge devices. First, a phone synchronous decoding (PSD) algorithm based on blank label skipping is first used to speed up the transducer decoding process. Then, to decrease the deletion errors introduced by the high blank score, a blank label deweighting approach is proposed. To reduce parameters and computation, deep feedforward sequential memory network (DFSMN) layers are used in the transducer encoder, and a CNN-based stateless predictor is adopted. SVD technology compresses the model further. WFST-based decoding graph takes the context-independent (CI) phone posteriors as input and allows us to flexibly bias user-specific information. Finally, with only 0.9M parameters after SVD, our system could give a relative 9.1 compared with a bigger conventional hybrid system on edge devices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/24/2023

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition

Although frame-based models, such as CTC and transducers, have an affini...
research
10/13/2021

Comparison of SVD and factorized TDNN approaches for speech to text

This work concentrates on reducing the RTF and word error rate of a hybr...
research
05/17/2023

Boosting Local Spectro-Temporal Features for Speech Analysis

We introduce the problem of phone classification in the context of speec...
research
04/13/2017

Mobile Keyboard Input Decoding with Finite-State Transducers

We propose a finite-state transducer (FST) representation for the models...
research
03/04/2018

Deep-FSMN for Large Vocabulary Continuous Speech Recognition

In this paper, we present an improved feedforward sequential memory netw...
research
07/01/2021

Interactive decoding of words from visual speech recognition models

This work describes an interactive decoding method to improve the perfor...
research
03/14/2022

Swap, Shift and Trim to Edge Collapse a Filtration

Boissonnat and Pritam introduced an algorithm to reduce a filtration of ...

Please sign up or login with your details

Forgot password? Click here to reset