DDS: A new device-degraded speech dataset for speech enhancement

09/16/2021
by   Haoyu Li, et al.
0

A large and growing amount of speech content in real-life scenarios is being recorded on common consumer devices in uncontrolled environments, resulting in degraded speech quality. Transforming such low-quality device-degraded speech into high-quality speech is a goal of speech enhancement (SE). This paper introduces a new speech dataset, DDS, to facilitate the research on SE. DDS provides aligned parallel recordings of high-quality speech (recorded in professional studios) and a number of versions of low-quality speech, producing approximately 2,000 hours speech data. The DDS dataset covers 27 realistic recording conditions by combining diverse acoustic environments and microphone devices, and each version of a condition consists of multiple recordings from six different microphone positions to simulate various signal-to-noise ratio (SNR) and reverberation levels. We also test several SE baseline systems on the DDS dataset and show the impact of recording diversity on performance.

READ FULL TEXT

page 2

page 4

research
11/10/2019

Transformation of low-quality device-recorded speech to high-quality speech using improved SEGAN model

Nowadays vast amounts of speech data are recorded from low-quality recor...
research
11/10/2020

Enhancing Low-Quality Voice Recordings Using Disentangled Channel Factor and Neural Waveform Model

High-quality speech corpora are essential foundations for most speech ap...
research
12/02/2022

Injecting Spatial Information for Monaural Speech Enhancement via Knowledge Distillation

Monaural speech enhancement (SE) provides a versatile and cost-effective...
research
10/29/2020

UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition

Speech enhancement at extremely low signal-to-noise ratio (SNR) conditio...
research
06/21/2021

Speech prosody and remote experiments: a technical report

The aim of this paper is twofold. First, we present a review of differen...
research
08/02/2021

Robust Acoustic Scene Classification in the Presence of Active Foreground Speech

We present an iVector based Acoustic Scene Classification (ASC) system s...
research
04/04/2022

GWA: A Large High-Quality Acoustic Dataset for Audio Processing

We present the Geometric-Wave Acoustic (GWA) dataset, a large-scale audi...

Please sign up or login with your details

Forgot password? Click here to reset