JVS corpus: free Japanese multi-speaker voice corpus

08/17/2019
by   Shinnosuke Takamichi, et al.
0

Thanks to improvements in machine learning techniques, including deep learning, speech synthesis is becoming a machine learning task. To accelerate speech synthesis research, we are developing Japanese voice corpora reasonably accessible from not only academic institutions but also commercial companies. In 2017, we released the JSUT corpus, which contains 10 hours of reading-style speech uttered by a single speaker, for end-to-end text-to-speech synthesis. For more general use in speech synthesis research, e.g., voice conversion and multi-speaker modeling, in this paper, we construct the JVS corpus, which contains voice data of 100 speakers in three styles (normal, whisper, and falsetto). The corpus contains 30 hours of voice data including 22 hours of parallel normal voices. This paper describes how we designed the corpus and summarizes the specifications. The corpus is available at our project page.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2017

JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis

Thanks to improvements in machine learning techniques including deep lea...
research
01/26/2022

J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis

In this paper, we construct a Japanese audiobook speech corpus called "J...
research
11/19/2020

TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos

We present the Tongue and Lips corpus (TaL), a multi-speaker corpus of a...
research
10/05/2020

JSSS: free Japanese speech corpus for summarization and simplification

In this paper, we construct a new Japanese speech corpus for speech-base...
research
06/04/2020

PJS: phoneme-balanced Japanese singing voice corpus

This paper presents a free Japanese singing voice corpus that can be use...
research
01/20/2020

JVS-MuSiC: Japanese multispeaker singing-voice corpus

Thanks to developments in machine learning techniques, it has become pos...
research
05/11/2020

End-To-End Speech Synthesis Applied to Brazilian Portuguese

Voice synthesis systems are popular in different applications, such as p...

Please sign up or login with your details

Forgot password? Click here to reset