CycleGAN-Based Unpaired Speech Dereverberation

03/29/2022
by   Hannah Muckenhirn, et al.
0

Typically, neural network-based speech dereverberation models are trained on paired data, composed of a dry utterance and its corresponding reverberant utterance. The main limitation of this approach is that such models can only be trained on large amounts of data and a variety of room impulse responses when the data is synthetically reverberated, since acquiring real paired data is costly. In this paper we propose a CycleGAN-based approach that enables dereverberation models to be trained on unpaired data. We quantify the impact of using unpaired data by comparing the proposed unpaired model to a paired model with the same architecture and trained on the paired version of the same dataset. We show that the performance of the unpaired model is comparable to the performance of the paired model on two different datasets, according to objective evaluation metrics. Furthermore, we run two subjective evaluations and show that both models achieve comparable subjective quality on the AMI dataset, which was not seen during training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/23/2021

Guided-TTS:Text-to-Speech with Untranscribed Speech

Most neural text-to-speech (TTS) models require <speech, transcript> pai...
research
10/27/2022

Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech

This paper proposes Virtuoso, a massively multilingual speech-text joint...
research
07/05/2018

Adaptive Paired-Comparison Method for Subjective Video Quality Assessment on Mobile Devices

To effectively evaluate subjective visual quality in weakly-controlled e...
research
10/28/2020

Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition

Despite the recent significant advances witnessed in end-to-end (E2E) AS...
research
02/02/2022

Keyword localisation in untranscribed speech using visually grounded speech models

Keyword localisation is the task of finding where in a speech utterance ...
research
12/10/2022

Synthetic Wave-Geometric Impulse Responses for Improved Speech Dereverberation

We present a novel approach to improve the performance of learning-based...
research
11/02/2022

Adversarial Guitar Amplifier Modelling With Unpaired Data

We propose an audio effects processing framework that learns to emulate ...

Please sign up or login with your details

Forgot password? Click here to reset