Learning to Model Aspects of Hearing Perception Using Neural Loss Functions

12/11/2019
by   Prateek Verma, et al.
0

We present a framework to model the perceived quality of audio signals by combining convolutional architectures, with ideas from classical signal processing, and describe an approach to enhancing perceived acoustical quality. We demonstrate the approach by transforming the sound of an inexpensive musical with degraded sound quality to that of a high-quality musical instrument without the need for parallel data which is often hard to collect. We adapt the classical approach of a simple adaptive EQ filtering to the objective criterion learned by a neural architecture and optimize it to get the signal of our interest. Since we learn adaptive masks depending on the signal of interest as opposed to a fixed transformation for all the inputs, we show that shallow neural architectures can achieve the desired result. A simple constraint on the objective and the initialization helps us in avoiding adversarial examples, which otherwise would have produced noisy, unintelligible audio. We believe that the current framework proposed has enormous applications, in a variety of problems where one can learn a loss function depending on the problem, using a neural architecture and optimize it after it has been learned.

READ FULL TEXT

page 3

page 4

research
08/12/2022

DDX7: Differentiable FM Synthesis of Musical Instrument Sounds

FM Synthesis is a well-known algorithm used to generate complex timbre f...
research
09/14/2023

DDSP-SFX: Acoustically-guided sound effects generation with differentiable digital signal processing

Controlling the variations of sound effects using neural audio synthesis...
research
02/10/2020

Unsupervised Learning of Audio Perception for Robotics Applications: Learning to Project Data to T-SNE/UMAP space

Audio perception is a key to solving a variety of problems ranging from ...
research
12/31/2020

Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding

Conventional audio coding technologies commonly leverage human perceptio...
research
11/22/2018

TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer

In this work, we address the problem of musical timbre transfer, where t...
research
07/04/2022

Stochastic Restoration of Heavily Compressed Musical Audio using Generative Adversarial Networks

Lossy audio codecs compress (and decompress) digital audio streams by re...
research
09/29/2021

Adaptive Approach For Sparse Representations Using The Locally Competitive Algorithm For Audio

Gammachirp filterbank has been used to approximate the cochlea in sparse...

Please sign up or login with your details

Forgot password? Click here to reset