Deepfake audio detection by speaker verification

09/28/2022
by   Alessandro Pianese, et al.
0

Thanks to recent advances in deep learning, sophisticated generation tools exist, nowadays, that produce extremely realistic synthetic speech. However, malicious uses of such tools are possible and likely, posing a serious threat to our society. Hence, synthetic voice detection has become a pressing research topic, and a large variety of detection methods have been recently proposed. Unfortunately, they hardly generalize to synthetic audios generated by tools never seen in the training phase, which makes them unfit to face real-world scenarios. In this work, we aim at overcoming this issue by proposing a new detection approach that leverages only the biometric characteristics of the speaker, with no reference to specific manipulations. Since the detector is trained only on real data, generalization is automatically ensured. The proposed approach can be implemented based on off-the-shelf speaker verification tools. We test several such solutions on three popular test sets, obtaining good performance, high generalization ability, and high robustness to audio impairment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2022

Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection

The rapid spread of media content synthesis technology and the potential...
research
07/28/2023

All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Recent advances in deep learning and computer vision have made the synth...
research
04/06/2022

Audio-Visual Person-of-Interest DeepFake Detection

Face manipulation technology is advancing very rapidly, and new methods ...
research
09/20/2021

"Hello, It's Me": Deep Learning-based Speech Synthesis Attacks in the Real World

Advances in deep learning have introduced a new wave of voice synthesis ...
research
02/11/2022

On the Detection of Adaptive Adversarial Attacks in Speaker Verification Systems

Speaker verification systems have been widely used in smart phones and I...
research
04/08/2020

Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification

Speaker verification systems usually suffer from the mismatch problem be...
research
06/27/2022

Attack Agnostic Dataset: Towards Generalization and Stabilization of Audio DeepFake Detection

Audio DeepFakes allow the creation of high-quality, convincing utterance...

Please sign up or login with your details

Forgot password? Click here to reset