HUI-Audio-Corpus-German: A high quality TTS dataset

06/11/2021
by   Pascal Puchtler, et al.
0

The increasing availability of audio data on the internet lead to a multitude of datasets for development and training of text to speech applications, based on neural networks. Highly differing quality of voice, low sampling rates, lack of text normalization and disadvantageous alignment of audio samples to corresponding transcript sentences still limit the performance of deep neural networks trained on this task. Additionally, data resources in languages like German are still very limited. We introduce the "HUI-Audio-Corpus-German", a large, open-source dataset for TTS engines, created with a processing pipeline, which produces high quality audio to transcription alignments and decreases manual effort needed for creation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2019

LibriVoxDeEn: A Corpus for German-to-English Speech Translation and Speech Recognition

We present a corpus of sentence-aligned triples of German audio, German ...
research
10/06/2020

Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus

We present a forced sentence alignment procedure for Swiss German speech...
research
04/11/2021

NeMo Toolbox for Speech Dataset Construction

In this paper, we introduce a new toolbox for constructing speech datase...
research
03/12/2022

A Proposal to Study "Is High Quality Data All We Need?"

Even though deep neural models have achieved superhuman performance on m...
research
05/24/2022

Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video Podcasts

We introduce the Merkel Podcast Corpus, an audio-visual-text corpus in G...
research
06/28/2022

On the Impact of Noises in Crowd-Sourced Data for Speech Translation

Training speech translation (ST) models requires large and high-quality ...
research
06/25/2021

Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition

Recent studies have shown that neural vocoders based on generative adver...

Please sign up or login with your details

Forgot password? Click here to reset