Audio Defect Detection in Music with Deep Networks

02/11/2022
by   Daniel Wolff, et al.
0

With increasing amounts of music being digitally transferred from production to distribution, automatic means of determining media quality are needed. Protection mechanisms in digital audio processing tools have not eliminated the need of production entities located downstream the distribution chain to assess audio quality and detect defects inserted further upstream. Such analysis often relies on the received audio and scarce meta-data alone. Deliberate use of artefacts such as clicks in popular music as well as more recent defects stemming from corruption in modern audio encodings call for data-centric and context sensitive solutions for detection. We present a convolutional network architecture following end-to-end encoder decoder configuration to develop detectors for two exemplary audio defects. A click detector is trained and compared to a traditional signal processing method, with a discussion on context sensitivity. Additional post-processing is used for data augmentation and workflow simulation. The ability of our models to capture variance is explored in a detector for artefacts from decompression of corrupted MP3 compressed audio. For both tasks we describe the synthetic generation of artefacts for controlled detector training and evaluation. We evaluate our detectors on the large open-source Free Music Archive (FMA) and genre-specific datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2021

Digital Audio Processing Tools for Music Corpus Studies

Digital audio processing tools offer music researchers the opportunity t...
research
11/14/2020

Towards transformation-resilient provenance detection of digital media

Advancements in deep generative models have made it possible to synthesi...
research
05/11/2021

Differentiable Signal Processing With Black-Box Audio Effects

We present a data-driven approach to automate audio signal processing by...
research
10/19/2021

Temporal separation of whale vocalizations from background oceanic noise using a power calculation

The process of analyzing audio signals in search of cetacean vocalizatio...
research
09/20/2023

Investigating Personalization Methods in Text to Music Generation

In this work, we investigate the personalization of text-to-music diffus...
research
06/10/2020

Exploring Quality and Generalizability in Parameterized Neural Audio Effects

Deep neural networks have shown promise for music audio signal processin...
research
12/15/2020

An Artistic Visualization of Music Modeling a Synesthetic Experience

This project brings music to sight. Music can be a visual masterpiece. S...

Please sign up or login with your details

Forgot password? Click here to reset