Sprachsynthese – State-of-the-Art in englischer und deutscher Sprache

06/11/2021
by   René Peinl, et al.
0

Reading text aloud is an important feature for modern computer applications. It not only facilitates access to information for visually impaired people, but is also a pleasant convenience for non-impaired users. In this article, the state of the art of speech synthesis is presented separately for mel-spectrogram generation and vocoders. It concludes with an overview of available data sets for English and German with a discussion of the transferability of the good speech synthesis results from English to German language.

READ FULL TEXT
research
05/31/2023

Text-to-Speech Pipeline for Swiss German – A comparison

In this work, we studied the synthesis of Swiss German speech using diff...
research
04/22/2022

LibriS2S: A German-English Speech-to-Speech Translation Corpus

Recently, we have seen an increasing interest in the area of speech-to-t...
research
01/17/2023

2nd Swiss German Speech to Standard German Text Shared Task at SwissText 2022

We present the results and findings of the 2nd Swiss German speech to St...
research
10/15/2021

Scribosermo: Fast Speech-to-Text models for German and other Languages

Recent Speech-to-Text models often require a large amount of hardware re...
research
05/18/2023

a unified front-end framework for english text-to-speech synthesis

The front-end is a critical component of English text-to-speech (TTS) sy...
research
09/27/2022

Direct Speech Translation for Automatic Subtitling

Automatic subtitling is the task of automatically translating the speech...

Please sign up or login with your details

Forgot password? Click here to reset