Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers

03/30/2022
by   Ziyue Xiang, et al.
0

Audio signals are often stored and transmitted in compressed formats. Among the many available audio compression schemes, MPEG-1 Audio Layer III (MP3) is very popular and widely used. Since MP3 is lossy it leaves characteristic traces in the compressed audio which can be used forensically to expose the past history of an audio file. In this paper, we consider the scenario of audio signal manipulation done by temporal splicing of compressed and uncompressed audio signals. We propose a method to find the temporal location of the splices based on transformer networks. Our method identifies which temporal portions of a audio signal have undergone single or multiple compression at the temporal frame level, which is the smallest temporal unit of MP3 compression. We tested our method on a dataset of 486,743 MP3 audio clips. Our method achieved higher performance and demonstrated robustness with respect to different MP3 data when compared with existing methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2022

Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks

Lossy audio codecs compress (and decompress) digital audio streams by re...
research
07/05/2019

Speech bandwidth extension with WaveNet

Large-scale mobile communication systems tend to contain legacy transmis...
research
10/19/2021

Temporal separation of whale vocalizations from background oceanic noise using a power calculation

The process of analyzing audio signals in search of cetacean vocalizatio...
research
10/13/2022

Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors

The objective of this paper is audio-visual synchronisation of general v...
research
02/10/2022

Barwise Compression Schemes for Audio-Based Music Structure Analysis

Music Structure Analysis (MSA) consists in segmenting a music piece in s...
research
01/21/2019

Spec-ResNet: A General Audio Steganalysis scheme based on Deep Residual Network of Spectrogram

The widespread application of audio and video communication technology m...
research
04/10/2023

Leveraging Neural Representations for Audio Manipulation

We investigate applying audio manipulations using pretrained neural netw...

Please sign up or login with your details

Forgot password? Click here to reset