Automatic Speech Recognition (ASR) in medical contexts has the potential...
Accurate genome sequencing can improve our understanding of biology and ...
Compression is essential to storing and transmitting medical videos, but...
Recent advances in self-supervision have dramatically improved the quali...
Many speech applications require understanding aspects beyond the words ...
We summarize the results of a host of efforts using giant automatic spee...
Automatic classification of disordered speech can provide an objective t...
Learned speech representations can drastically improve performance on ta...
The ultimate goal of transfer learning is to reduce labeled data require...
Automatic speech recognition (ASR) systems have dramatically improved ov...
We present an extension to the Tacotron speech synthesis architecture th...
In this work, we propose "global style tokens" (GSTs), a bank of embeddi...
Deep neural networks represent a powerful class of function approximator...
Prosodic modeling is a core problem in speech synthesis. The key challen...
We introduce a stop-code tolerant (SCT) approach to training recurrent
c...
We propose a method for lossy image compression based on recurrent,
conv...
This paper presents a set of full-resolution lossy image compression met...