Pykaldi2: Yet another speech toolkit based on Kaldi and Pytorch

07/12/2019
by   Liang Lu, et al.
0

We introduce PyKaldi2 speech recognition toolkit implemented based on Kaldi and PyTorch. While similar toolkits are available built on top of the two, a key feature of PyKaldi2 is sequence training with criteria such as MMI, sMBR and MPE. In particular, we implemented the sequence training module with on-the-fly lattice generation during model training in order to simplify the training pipeline. To address the challenging acoustic environments in real applications, PyKaldi2 also supports on-the-fly noise and reverberation simulation to improve the model robustness. With this feature, it is possible to backpropogate the gradients from the sequence-level loss to the front-end feature extraction module, which, hopefully, can foster more research in the direction of joint front-end and backend learning. We performed benchmark experiments on Librispeech, and show that PyKaldi2 can achieve reasonable recognition accuracy. The toolkit is released under the MIT license.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2020

ESPnet-ST: All-in-One Speech Translation Toolkit

We present ESPnet-ST, which is designed for the quick development of spe...
research
12/18/2020

NeurST: Neural Speech Translation Toolkit

NeurST is an open-source toolkit for neural speech translation developed...
research
08/27/2022

Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments

This research work is about recent development made in speech recognitio...
research
04/03/2021

ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi

The availability of open-source software is playing a remarkable role in...
research
10/17/2022

A Treatise On FST Lattice Based MMI Training

Maximum mutual information (MMI) has become one of the two de facto meth...
research
11/19/2018

The PyTorch-Kaldi Speech Recognition Toolkit

The availability of open-source software is playing a remarkable role in...
research
08/02/2019

SANTLR: Speech Annotation Toolkit for Low Resource Languages

While low resource speech recognition has attracted a lot of attention f...

Please sign up or login with your details

Forgot password? Click here to reset