Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks

07/04/2022
by   Stefan Lattner, et al.
0

Lossy audio codecs compress (and decompress) digital audio streams by removing information that tends to be inaudible in human perception. Under high compression rates, such codecs may introduce a variety of impairments in the audio signal. Many works have tackled the problem of audio enhancement and compression artifact removal using deep learning techniques. However, only a few works tackle the restoration of heavily compressed audio signals in the musical domain. In such a scenario, there is no unique solution for the restoration of the original signal. Therefore, in this study, we test a stochastic generator of a Generative Adversarial Network (GAN) architecture for this task. Such a stochastic generator, conditioned on highly compressed musical audio signals, could one day generate outputs indistinguishable from high-quality releases. Therefore, the present study may yield insights into more efficient musical data storage and transmission. We train stochastic and deterministic generators on MP3-compressed audio signals with 16, 32, and 64 kbit/s. We perform an extensive evaluation of the different experiments utilizing objective metrics and listening tests. We find that the models can improve the quality of the audio signals over the MP3 versions for 16 and 32 kbit/s and that the stochastic generators are capable of generating outputs that are closer to the original signals than those of the deterministic generators.

READ FULL TEXT

page 3

page 5

page 8

page 9

page 11

page 14

page 15

page 16

research
08/03/2021

A Benchmarking Initiative for Audio-Domain Music Generation Using the Freesound Loop Dataset

This paper proposes a new benchmark task for generat-ing musical passage...
research
03/30/2022

Forensic Analysis and Localization of Multiply Compressed MP3 Audio Using Transformers

Audio signals are often stored and transmitted in compressed formats. Am...
research
09/25/2019

High Fidelity Speech Synthesis with Adversarial Networks

Generative adversarial networks have seen rapid development in recent ye...
research
05/11/2020

GACELA – A generative adversarial context encoder for long audio inpainting

We introduce GACELA, a generative adversarial network (GAN) designed to ...
research
09/06/2021

Machine Learning: Challenges, Limitations, and Compatibility for Audio Restoration Processes

In this paper machine learning networks are explored for their use in re...
research
12/30/2022

Blind Restoration of Real-World Audio by 1D Operational GANs

Objective: Despite numerous studies proposed for audio restoration in th...
research
12/11/2019

Learning to Model Aspects of Hearing Perception Using Neural Loss Functions

We present a framework to model the perceived quality of audio signals b...

Please sign up or login with your details

Forgot password? Click here to reset