Emotional Voice Conversion: Theory, Databases and ESD

05/31/2021
by   Kun Zhou, et al.
0

In this paper, we first provide a review of the state-of-the-art emotional voice conversion research, and the existing emotional speech databases. We then motivate the development of a novel emotional speech database (ESD) that addresses the increasing research need. With this paper, the ESD database is now made available to the research community. The ESD database consists of 350 parallel utterances spoken by 10 native English and 10 native Chinese speakers and covers 5 emotion categories (neutral, happy, angry, sad and surprise). More than 29 hours of speech data were recorded in a controlled acoustic environment. The database is suitable for multi-speaker and cross-lingual emotional voice conversion studies. As case studies, we implement several state-of-the-art emotional voice conversion systems on the ESD database. This paper provides a reference study on ESD in conjunction with its release.

READ FULL TEXT
research
07/21/2021

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

We present an unsupervised non-parallel many-to-many voice conversion (V...
research
06/25/2018

The Emotional Voices Database: Towards Controlling the Emotion Dimension in Voice Generation Systems

In this paper, we present a database of emotional speech intended to be ...
research
06/04/2019

ShEMO -- A Large-Scale Validated Database for Persian Speech Emotion Detection

This paper introduces a large-scale, validated database for Persian call...
research
12/01/2020

NHSS: A Speech and Singing Parallel Database

We present a database of parallel recordings of speech and singing, coll...
research
12/07/2021

From Assistants to Friends: Investigating Emotional Intelligence of IPAs in Hindi and English

Intelligent Personal Assistants (IPAs) like Amazon Alexa, Apple Siri, an...
research
12/29/2019

A Comparative Study of Pitch Extraction Algorithms on a Large Variety of Singing Sounds

The problem of pitch tracking has been extensively studied in the speech...
research
01/09/2021

Spanish expressive voices: Corpus for emotion research in spanish

A new emotional multimedia database has been recorded and aligned. The d...

Please sign up or login with your details

Forgot password? Click here to reset