The Sound of Silence: Efficiency of First Digit Features in Synthetic Audio Detection

10/06/2022
by   Daniele Mari, et al.
0

The recent integration of generative neural strategies and audio processing techniques have fostered the widespread of synthetic speech synthesis or transformation algorithms. This capability proves to be harmful in many legal and informative processes (news, biometric authentication, audio evidence in courts, etc.). Thus, the development of efficient detection algorithms is both crucial and challenging due to the heterogeneity of forgery techniques. This work investigates the discriminative role of silenced parts in synthetic speech detection and shows how first digit statistics extracted from MFCC coefficients can efficiently enable a robust detection. The proposed procedure is computationally-lightweight and effective on many different algorithms since it does not rely on large neural detection architecture and obtains an accuracy above 90% in most of the classes of the ASVSpoof dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2021

FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection

As increasing development of text-to-speech (TTS) and voice conversion (...
research
08/21/2022

System Fingerprints Detection for DeepFake Audio: An Initial Dataset and Investigation

Many effective attempts have been made for deepfake audio detection. How...
research
08/02/2022

Audio Deepfake Detection Based on a Combination of F0 Information and Real Plus Imaginary Spectrogram Features

Recently, pioneer research works have proposed a large number of acousti...
research
07/28/2023

All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Recent advances in deep learning and computer vision have made the synth...
research
07/29/2022

Towards Unconstrained Audio Splicing Detection and Localization with Neural Networks

Freely available and easy-to-use audio editing tools make it straightfor...
research
03/07/2022

Detection of AI Synthesized Hindi Speech

The recent advancements in generative artificial speech models have made...
research
03/30/2022

Does Audio Deepfake Detection Generalize?

Current text-to-speech algorithms produce realistic fakes of human voice...

Please sign up or login with your details

Forgot password? Click here to reset