Contrastive Predictive Coding (CPC) is a representation learning method ...
We present an approach to reduce the performance disparity between geogr...
While end-to-end models have shown great success on the Automatic Speech...
End-to-end (E2E) automatic speech recognition (ASR) models have recently...
Wav2vec-C introduces a novel representation learning technique combining...
Accents mismatching is a critical problem for end-to-end ASR. This paper...
In this work, we propose a novel and efficient minimum word error rate (...
In this paper, we propose a streaming model to distinguish voice queries...
Multilingual ASR technology simplifies model training and deployment, bu...
Acoustic models in real-time speech recognition systems typically stack
...
This paper presents our modeling and architecture approaches for buildin...
We present a speech data corpus that simulates a "dinner party" scenario...
For real-world speech recognition applications, noise robustness is stil...
This article presents a whisper speech detector in the far-field domain....
In this work, we propose a classifier for distinguishing device-directed...
In this article, we present the elitist particle filter based on evoluti...
In this article, we derive a new stepsize adaptation for the normalized ...
We propose a spatial diffuseness feature for deep neural network (DNN)-b...
This article provides a unifying Bayesian network view on various approa...