
-
Transformer-Transducers for Code-Switched Speech Recognition
We live in a world where 60 languages fluently. Members of these communi...
read it
-
Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Non-autoregressive models greatly improve decoding speed over typical se...
read it
-
Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech
In this work, we explore a multimodal semi-supervised learning approach ...
read it
-
Robust Prediction of Punctuation and Truecasing for Medical ASR
Automatic speech recognition (ASR) systems in the medical domain that fo...
read it
-
Robust Prediction of Punctuation and Truecasingfor Medical ASR
Automatic speech recognition (ASR) systems in the medical domain that fo...
read it
-
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
We propose a novel approach to semi-supervised automatic speech recognit...
read it
-
Pseudolikelihood Reranking with Masked Language Models
We rerank with scores from pretrained masked language models like BERT t...
read it
-
Contextual Phonetic Pretraining for End-to-end Utterance-level Language and Speaker Recognition
Pretrained contextual word representations in NLP have greatly improved ...
read it
-
Simple, Fast, Accurate Intent Classification and Slot Labeling
In real-time dialogue systems running at scale, there is a tradeoff betw...
read it
-
Multi-stream Network With Temporal Attention For Environmental Sound Classification
Environmental sound classification systems often do not perform robustly...
read it
-
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Self-attention has demonstrated great success in sequence-to-sequence ta...
read it
-
Context Models for OOV Word Translation in Low-Resource Languages
Out-of-vocabulary word translation is a major problem for the translatio...
read it
-
Syntactic and Semantic Features For Code-Switching Factored Language Models
This paper presents our latest investigations on different features for ...
read it
-
Exploiting Out-of-Domain Data Sources for Dialectal Arabic Statistical Machine Translation
Statistical machine translation for dialectal Arabic is characterized by...
read it