Joel Shor

research

∙ 03/10/2023

Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings

Automatic Speech Recognition (ASR) in medical contexts has the potential...

0 Joel Shor, et al. ∙

research

∙ 11/17/2022

Knowledge distillation for fast and accurate DNA sequence correction

Accurate genome sequencing can improve our understanding of biology and ...

0 Anastasiya Belyaeva, et al. ∙

research

∙ 11/02/2022

The Need for Medically Aware Video Compression in Gastroenterology

Compression is essential to storing and transmitting medical videos, but...

0 Joel Shor, et al. ∙

research

∙ 03/01/2022

TRILLsson: Distilled Universal Paralinguistic Speech Representations

Recent advances in self-supervision have dramatically improved the quali...

0 Joel Shor, et al. ∙

research

∙ 10/09/2021

Universal Paralinguistic Speech Representations Using Self-Supervised Conformers

Many speech applications require understanding aspects beyond the words ...

0 Joel Shor, et al. ∙

research

∙ 09/27/2021

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition

We summarize the results of a host of efforts using giant automatic spee...

1 Yu Zhang, et al. ∙

research

∙ 07/08/2021

Comparing Supervised Models And Learned Speech Representations For Classifying Intelligibility Of Disordered Speech On Selected Phrases

Automatic classification of disordered speech can provide an objective t...

0 Subhashini Venugopalan, et al. ∙

research

∙ 11/09/2020

FUN! Fast, Universal, Non-Semantic Speech Embeddings

Learned speech representations can drastically improve performance on ta...

0 Jacob Peplinski, et al. ∙

research

∙ 02/25/2020

Towards Learning a Universal Non-Semantic Representation of Speech

The ultimate goal of transfer learning is to reduce labeled data require...

0 Joel Shor, et al. ∙

research

∙ 07/31/2019

Personalizing ASR for Dysarthric and Accented Speech with Limited Data

Automatic speech recognition (ASR) systems have dramatically improved ov...

0 Joel Shor, et al. ∙

research

∙ 03/24/2018

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron

We present an extension to the Tacotron speech synthesis architecture th...

0 RJ Skerry-Ryan, et al. ∙

research

∙ 03/23/2018

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

In this work, we propose "global style tokens" (GSTs), a bank of embeddi...

0 Yuxuan Wang, et al. ∙

research

∙ 02/07/2018

Spatially adaptive image compression using a tiled deep network

Deep neural networks represent a powerful class of function approximator...

0 David Minnen, et al. ∙

research

∙ 11/01/2017

Uncovering Latent Style Factors for Expressive Speech Synthesis

Prosodic modeling is a core problem in speech synthesis. The key challen...

0 Yuxuan Wang, et al. ∙

research

∙ 05/18/2017

Target-Quality Image Compression with Recurrent, Convolutional Neural Networks

We introduce a stop-code tolerant (SCT) approach to training recurrent c...

0 Michele Covell, et al. ∙

research

∙ 03/29/2017

Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

We propose a method for lossy image compression based on recurrent, conv...

0 Nick Johnston, et al. ∙

research

∙ 08/18/2016

Full Resolution Image Compression with Recurrent Neural Networks

This paper presents a set of full-resolution lossy image compression met...

0 George Toderici, et al. ∙

Joel Shor

Featured Co-authors

Sign in with Google

Consider DeepAI Pro