In this paper, we propose a fully supervised speaker diarization approac...
In this work, we perform an empirical comparison among the CTC,
RNN-Tran...
We propose a probabilistic framework for domain adaptation that blends b...
Replacing hand-engineered pipelines with end-to-end deep learning system...
We present Deep Speaker, a neural speaker embedding system that maps
utt...
Most existing sequence labelling models rely on a fixed decomposition of...
Deep learning has dramatically improved the performance of speech recogn...
We show that an end-to-end deep learning approach can be used to recogni...
In this paper, we propose multi-stage and deformable deep convolutional
...
Various factors, such as identities, views (poses), and illuminations, a...