DeepAI AI Chat
Log In Sign Up

The Microsoft 2016 Conversational Speech Recognition System

by   W. Xiong, et al.

We describe Microsoft's conversational speech recognition system, in which we combine recent developments in neural-network-based acoustic and language modeling to advance the state of the art on the Switchboard recognition task. Inspired by machine learning ensemble techniques, the system uses a range of convolutional and recurrent neural networks. I-vector modeling and lattice-free MMI training provide significant gains for all acoustic model architectures. Language model rescoring with multiple forward and backward running RNNLMs, and word posterior-based system combination provide a 20 system uses a ResNet architecture acoustic model with RNNLM rescoring, and achieves a word error rate of 6.9 combined system has an error rate of 6.2 previously reported results on this benchmark task.


page 1

page 2

page 3

page 4


The Microsoft 2017 Conversational Speech Recognition System

We describe the 2017 version of Microsoft's conversational speech recogn...

Achieving Human Parity in Conversational Speech Recognition

Conversational speech recognition has served as a flagship speech recogn...

The IBM 2016 English Conversational Telephone Speech Recognition System

We describe a collection of acoustic and language modeling techniques th...

ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition

In this paper we present state-of-the-art (SOTA) performance on the Libr...

Signal Combination for Language Identification

Google's multilingual speech recognition system combines low-level acous...

Advancing RNN Transducer Technology for Speech Recognition

We investigate a set of techniques for RNN Transducers (RNN-Ts) that wer...

Acoustic Modeling Using a Shallow CNN-HTSVM Architecture

High-accuracy speech recognition is especially challenging when large da...

Code Repositories


Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

view repo