Towards Unconstrained Audio Splicing Detection and Localization with Neural Networks

07/29/2022
by   Denise Moussa, et al.
0

Freely available and easy-to-use audio editing tools make it straightforward to perform audio splicing. Convincing forgeries can be created by combining various speech samples from the same person. Detection of such splices is important both in the public sector when considering misinformation, and in a legal context to verify the integrity of evidence. Unfortunately, most existing detection algorithms for audio splicing use handcrafted features and make specific assumptions. However, criminal investigators are often faced with audio samples from unconstrained sources with unknown characteristics, which raises the need for more generally applicable methods. With this work, we aim to take a first step towards unconstrained audio splicing detection to address this need. We simulate various attack scenarios in the form of post-processing operations that may disguise splicing. We propose a Transformer sequence-to-sequence (seq2seq) network for splicing detection and localization. Our extensive evaluation shows that the proposed method outperforms existing dedicated approaches for splicing detection [3, 10] as well as the general-purpose networks EfficientNet [28] and RegNet [25].

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2023

Point to the Hidden: Exposing Speech Audio Splicing via Signal Pointer Nets

Verifying the integrity of voice recording evidence for criminal investi...
research
02/15/2023

Fast Blind Audio Copy-Move Detection and Localization Using Local Feature Tensors in Noise

The increasing availability of audio editing software altering digital a...
research
08/21/2022

System Fingerprints Detection for DeepFake Audio: An Initial Dataset and Investigation

Many effective attempts have been made for deepfake audio detection. How...
research
10/06/2022

The Sound of Silence: Efficiency of First Digit Features in Synthetic Audio Detection

The recent integration of generative neural strategies and audio process...
research
02/20/2019

Dual-modality seq2seq network for audio-visual event localization

Audio-visual event localization requires one to identify theevent which ...
research
09/07/2023

Topological fingerprints for audio identification

We present a topological audio fingerprinting approach for robustly iden...
research
11/06/2022

Going In Style: Audio Backdoors Through Stylistic Transformations

A backdoor attack places triggers in victims' deep learning models to en...

Please sign up or login with your details

Forgot password? Click here to reset