Roland Maas

research

∙ 10/22/2022

Guided contrastive self-supervised pre-training for automatic speech recognition

Contrastive Predictive Coding (CPC) is a representation learning method ...

0 Aparna Khare, et al. ∙

research

∙ 07/16/2022

Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation

We present an approach to reduce the performance disparity between geogr...

0 Viet Anh Trinh, et al. ∙

research

∙ 02/22/2022

VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition

While end-to-end models have shown great success on the Automatic Speech...

0 Jinhan Wang, et al. ∙

research

∙ 06/14/2021

SynthASR: Unlocking Synthetic Data for Speech Recognition

End-to-end (E2E) automatic speech recognition (ASR) models have recently...

0 Amin Fazel, et al. ∙

research

∙ 03/09/2021

Wav2vec-C: A Self-supervised Model for Speech Representation Learning

Wav2vec-C introduces a novel representation learning technique combining...

0 Samik Sadhu, et al. ∙

research

∙ 12/14/2020

REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling

Accents mismatching is a critical problem for end-to-end ASR. This paper...

0 Hu Hu, et al. ∙

research

∙ 07/27/2020

Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition

In this work, we propose a novel and efficient minimum word error rate (...

0 Jinxi Guo, et al. ∙

research

∙ 07/17/2020

Streaming ResLSTM with Causal Mean Aggregation for Device-Directed Utterance Detection

In this paper, we propose a streaming model to distinguish voice queries...

0 Xiaosu Tong, et al. ∙

research

∙ 07/08/2020

Streaming End-to-End Bilingual ASR Systems with Joint Language Identification

Multilingual ASR technology simplifies model training and deployment, bu...

0 Surabhi Punjabi, et al. ∙

research

∙ 06/30/2020

Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech Recognition

Acoustic models in real-time speech recognition systems typically stack ...

0 Maarten Van Segbroeck, et al. ∙

research

∙ 06/01/2020

Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses

This paper presents our modeling and architecture approaches for buildin...

0 Chander Chandak, et al. ∙

research

∙ 09/30/2019

DiPCo – Dinner Party Corpus

We present a speech data corpus that simulates a "dinner party" scenario...

0 Maarten Van Segbroeck, et al. ∙

research

∙ 01/05/2019

Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning

For real-world speech recognition applications, noise robustness is stil...

6 Ladislav Mošner, et al. ∙

research

∙ 09/20/2018

LSTM-based Whisper Detection

This article presents a whisper speech detector in the far-field domain....

0 Zeynab Raeesy, et al. ∙

research

∙ 08/07/2018

Device-directed Utterance Detection

In this work, we propose a classifier for distinguishing device-directed...

0 Sri Harish Mallidi, et al. ∙

research

∙ 04/14/2016

Estimating parameters of nonlinear systems using the elitist particle filter based on evolutionary strategies

In this article, we present the elitist particle filter based on evoluti...

0 Christian Huemmer, et al. ∙

research

∙ 11/18/2014

The NLMS algorithm with time-variant optimum stepsize derived from a Bayesian network perspective

In this article, we derive a new stepsize adaptation for the normalized ...

0 Christian Huemmer, et al. ∙

research

∙ 10/09/2014

Spatial Diffuseness Features for DNN-Based Speech Recognition in Noisy and Reverberant Environments

We propose a spatial diffuseness feature for deep neural network (DNN)-b...

0 Andreas Schwarz, et al. ∙

research

∙ 10/11/2013

A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition

This article provides a unifying Bayesian network view on various approa...

0 Roland Maas, et al. ∙

Roland Maas

Featured Co-authors

Sign in with Google

Consider DeepAI Pro