EmoFake: An Initial Dataset for Emotion Fake Audio Detection

11/10/2022
by   Yan Zhao, et al.
0

There are already some datasets used for fake audio detection, such as the ASVspoof and ADD datasets. However, these databases do not consider a situation that the emotion of the audio has been changed from one to another, while other information (e.g. speaker identity and content) remains the same. Changing emotions often leads to semantic changes. This may be a great threat to social stability. Therefore, this paper reports our progress in developing such an emotion fake audio detection dataset involving changing emotion state of the original audio. The dataset is named EmoFake. The fake audio in EmoFake is generated using the state-of-the-art emotion voice conversion models. Some benchmark experiments are conducted on this dataset. The results show that our designed dataset poses a challenge to the LCNN and RawNet2 baseline models of ASVspoof 2021.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2021

Half-Truth: A Partially Fake Audio Detection Dataset

Diverse promising datasets have been designed to hold back the developme...
research
08/20/2022

An Initial Investigation for Detecting Vocoder Fingerprints of Fake Audio

Many effective attempts have been made for fake audio detection. However...
research
09/05/2023

FSD: An Initial Chinese Dataset for Fake Song Detection

Singing voice synthesis and singing voice conversion have significantly ...
research
11/11/2022

SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection

Previous databases have been designed to further the development of fake...
research
09/14/2023

StarGAN-VC++: Towards Emotion Preserving Voice Conversion Using Deep Embeddings

Voice conversion (VC) transforms an utterance to sound like another pers...
research
06/27/2022

Attack Agnostic Dataset: Towards Generalization and Stabilization of Audio DeepFake Detection

Audio DeepFakes allow the creation of high-quality, convincing utterance...
research
05/28/2020

DeepSonar: Towards Effective and Robust Detection of AI-Synthesized Fake Voices

With the recent advances in voice synthesis, AI-synthesized fake voices ...

Please sign up or login with your details

Forgot password? Click here to reset