Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications

01/14/2021
by   Yoo Rhee Oh, et al.
0

Many real-world applications require to convert speech files into text with high accuracy with limited resources. This paper proposes a method to recognize large speech database fast using the Transformer-based end-to-end model. Transfomers have improved the state-of-the-art performance in many fields as well as speech recognition. But it is not easy to be used for long sequences. In this paper, various techniques to speed up the recognition of real-world speeches are proposed and tested including parallelizing the recognition using batched beam search, detecting end-of-speech based on connectionist temporal classification (CTC), restricting CTC prefix score and splitting long speeches into short segments. Experiments are conducted with real-world Korean speech recognition task. Experimental results with an 8-hour test corpus show that the proposed system can convert speeches into text in less than 3 minutes with 10.73 conventional DNN-HMM based recognition system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/16/2019

Continuous Speech Recognition using EEG and Video

In this paper we investigate whether electroencephalography (EEG) featur...
research
12/14/2020

A review of on-device fully neural end-to-end automatic speech recognition algorithms

In this paper, we review various end-to-end automatic speech recognition...
research
03/02/2023

LiteG2P: A fast, light and high accuracy model for grapheme-to-phoneme conversion

As a key component of automated speech recognition (ASR) and the front-e...
research
01/10/2013

Statistical Modeling in Continuous Speech Recognition (CSR)(Invited Talk)

Automatic continuous speech recognition (CSR) is sufficiently mature tha...
research
05/08/2023

Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition

Conformer-based models have become the most dominant end-to-end architec...
research
02/28/2023

Read Pointer Meters in complex environments based on a Human-like Alignment and Recognition Algorithm

Recently, developing an automatic reading system for analog measuring in...
research
02/21/2022

End-to-End High Accuracy License Plate Recognition Based on Depthwise Separable Convolution Networks

Automatic license plate recognition plays a crucial role in modern trans...

Please sign up or login with your details

Forgot password? Click here to reset