Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation

09/06/2020
by   Akhil Mathur, et al.
0

This paper introduces a new dataset, Libri-Adapt, to support unsupervised domain adaptation research on speech recognition models. Built on top of the LibriSpeech corpus, Libri-Adapt contains English speech recorded on mobile and embedded-scale microphones, and spans 72 different domains that are representative of the challenging practical scenarios encountered by ASR models. More specifically, Libri-Adapt facilitates the study of domain shifts in ASR models caused by a) different acoustic environments, b) variations in speaker accents, c) heterogeneity in the hardware and platform software of the microphones, and d) a combination of the aforementioned three shifts. We also provide a number of baseline results quantifying the impact of these domain shifts on the Mozilla DeepSpeech2 ASR model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/21/2018

Adversarial Learning of Raw Speech Features for Domain Invariant Speech Recognition

Recent advances in neural network based acoustic modelling have shown si...
research
11/26/2020

Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training

The performance of automatic speech recognition (ASR) systems typically ...
research
06/22/2022

A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data

Automatic Speech Recognition(ASR) has been dominated by deep learning-ba...
research
06/03/2023

SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization

Automatic speech recognition (ASR) models are frequently exposed to data...
research
07/30/2018

Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech Recognition

In this paper, we investigate the use of adversarial learning for unsupe...
research
09/20/2018

Can Deep Clinical Models Handle Real-World Domain Shifts?

The hypothesis that computational models can be reliable enough to be ad...
research
07/27/2021

Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach

Online harassment in the form of hate speech has been on the rise in rec...

Please sign up or login with your details

Forgot password? Click here to reset