Detecting Synthetic Speech Manipulation in Real Audio Recordings

09/15/2022
by   Md Hafizur Rahman, et al.
0

Recent advances in artificial speech and audio technologies have improved the abilities of deep-fake operators to falsify media and spread malicious misinformation. Anyone with limited coding skills can use freely available speech synthesis tools to create convincing simulations of influential speakers' voices with the malicious intent to distort the original message. With the latest technology, malicious operators do not have to generate an entire audio clip; instead, they can insert a partial manipulation or a segment of synthetic speech into a genuine audio recording to change the entire context and meaning of the original message. Detecting these insertions is especially challenging because partially manipulated audio can more easily avoid synthetic speech detectors than entirely fake messages can. This paper describes a potential partial synthetic speech detection system based on the x-ResNet architecture with a probabilistic linear discriminant analysis (PLDA) backend and interleaved aware score processing. Experimental results suggest that the PLDA backend results in a 25 synthesized datasets over a non-PLDA baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2021

Half-Truth: A Partially Fake Audio Detection Dataset

Diverse promising datasets have been designed to hold back the developme...
research
10/18/2021

FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection

As increasing development of text-to-speech (TTS) and voice conversion (...
research
10/23/2020

A Cross-Verification Approach for Protecting World Leaders from Fake and Tampered Audio

This paper tackles the problem of verifying the authenticity of speech r...
research
05/22/2023

Towards generalizing deep-audio fake detection networks

Today's generative neural networks allow the creation of high-quality sy...
research
09/15/2023

Syn-Att: Synthetic Speech Attribution via Semi-Supervised Unknown Multi-Class Ensemble of CNNs

With the huge technological advances introduced by deep learning in audi...
research
07/28/2023

All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Recent advances in deep learning and computer vision have made the synth...
research
10/17/2021

Storage and Authentication of Audio Footage for IoAuT Devices Using Distributed Ledger Technology

Detection of fabricated or manipulated audio content to prevent, e.g., d...

Please sign up or login with your details

Forgot password? Click here to reset