Convolutional frontends are a typical choice for Transformer-based autom...
Automatic speech recognition (ASR) in the cloud allows the use of larger...
Subwords are the most widely used output units in end-to-end speech
reco...
We present an unsupervised training approach for a neural network-based ...
This paper introduces a new open source platform for end-to-end speech
p...