Towards generalizing deep-audio fake detection networks

05/22/2023
by   Konstantin Gasenzer, et al.
0

Today's generative neural networks allow the creation of high-quality synthetic speech at scale. While we welcome the creative use of this new technology, we must also recognize the risks. As synthetic speech is abused for both monetary and identity theft, we require a broad set of deep fake identification tools. Furthermore, previous work reported a limited ability of deep classifiers to generalize to unseen audio generators. By leveraging the wavelet-packet and short-time Fourier transform, we train excellent lightweight detectors that generalize. We report improved results on an extension of the WaveFake dataset. To account for the rapid progress in the field, we additionally consider samples drawn from the novel Avocodo and BigVGAN networks.

READ FULL TEXT
research
04/08/2021

Half-Truth: A Partially Fake Audio Detection Dataset

Diverse promising datasets have been designed to hold back the developme...
research
09/15/2022

Detecting Synthetic Speech Manipulation in Real Audio Recordings

Recent advances in artificial speech and audio technologies have improve...
research
01/08/2023

Deepfake CAPTCHA: A Method for Preventing Fake Calls

Deep learning technology has made it possible to generate realistic cont...
research
03/29/2019

Training a Neural Speech Waveform Model using Spectral Losses of Short-Time Fourier Transform and Continuous Wavelet Transform

Recently, we proposed short-time Fourier transform (STFT)-based loss fun...
research
02/25/2023

Why Do Deepfake Detectors Fail?

Recent rapid advancements in deepfake technology have allowed the creati...
research
09/16/2022

TIMIT-TTS: a Text-to-Speech Dataset for Multimodal Synthetic Media Detection

With the rapid development of deep learning techniques, the generation a...
research
09/11/2023

Towards generalisable and calibrated synthetic speech detection with self-supervised representations

Generalisation – the ability of a model to perform well on unseen data –...

Please sign up or login with your details

Forgot password? Click here to reset