In this paper, we introduce the Kaizen framework that uses a continuousl...
In this work, to measure the accuracy and efficiency for a latency-contr...
End-to-end automatic speech recognition (ASR) models with a single neura...
In this work, we first show that on the widely used LibriSpeech benchmar...
Many semi- and weakly-supervised approaches have been investigated for
o...
Videos uploaded on social media are often accompanied with textual
descr...
Self-supervised learning has advanced rapidly, with several results beat...
Supervised ASR models have reached unprecedented levels of accuracy, tha...
Deep acoustic models typically receive features in the first layer of th...
We propose and evaluate transformer-based acoustic models (AMs) for hybr...
There is an implicit assumption that traditional hybrid approaches for
a...
Towards developing high-performing ASR for low-resource languages, appro...
End-to-end learning of recurrent neural networks (RNNs) is an attractive...
In this paper we propose a neural conversation model for conducting
dial...
This paper presents a model for end-to-end learning of task-oriented dia...
In a conversation or a dialogue process, attention and intention play
in...
Two recent approaches have achieved state-of-the-art results in image
ca...
This paper presents a novel approach for automatically generating image
...