Manner of Articulation Detection using Connectionist Temporal Classification to Improve Automatic Speech Recognition Performance

11/05/2018
by   Pradeep R, et al.
0

Conventionally, the manner of articulations in speech signal are derived using discriminative signal processing techniques or deep learning approaches. However, training such complex systems involves feature extraction, phoneme force alignment and deep neural network training. In our work, we initially detect the manner of articulations without phoneme alignment using an end-to-end manner of articulation modeling based on connectionist temporal classification (CTC). The manner of articulation knowledge is deployed in the conventional character CTC path to regenerate the new character CTC path. The modified manner based character CTC is evaluated on open source speech datasets such as AN4, LibriSpeech and TEDLIUM-2 and it outperforms over the baseline character CTC.

READ FULL TEXT
research
11/16/2018

Beam Search Decoding using Manner of Articulation Detection Knowledge Derived from Connectionist Temporal Classification

Manner of articulation detection using deep neural networks require a pr...
research
03/15/2022

End-to-end P300 BCI using Bayesian accumulation of Riemannian probabilities

In brain-computer interfaces (BCI), most of the approaches based on even...
research
05/10/2018

A comparable study of modeling units for end-to-end Mandarin speech recognition

End-To-End speech recognition have become increasingly popular in mandar...
research
12/22/2019

power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition

In this paper, we describe the Maximum Uniformity of Distribution (MUD) ...
research
06/09/2021

A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition

End-to-end (E2E) modeling is advantageous for automatic speech recogniti...
research
02/18/2019

End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition Model

Time-aligned lyrics can enrich the music listening experience by enablin...
research
10/20/2022

Speech Dereverberation with a Reverberation Time Shortening Target

This work proposes a new learning target based on reverberation time sho...

Please sign up or login with your details

Forgot password? Click here to reset