A Feature Learning Siamese Model for Intelligent Control of the Dynamic Range Compressor

05/01/2019
by   Di Sheng, et al.
0

In this paper, a siamese DNN model is proposed to learn the characteristics of the audio dynamic range compressor (DRC). This facilitates an intelligent control system that uses audio examples to configure the DRC, a widely used non-linear audio signal conditioning technique in the areas of music production, speech communication and broadcasting. Several alternative siamese DNN architectures are proposed to learn feature embeddings that can characterise subtle effects due to dynamic range compression. These models are compared with each other as well as handcrafted features proposed in previous work. The evaluation of the relations between the hyperparameters of DNN and DRC parameters are also provided. The best model is able to produce a universal feature embedding that is capable of predicting multiple DRC parameters simultaneously, which is a significant improvement from our previous research. The feature embedding shows better performance than handcrafted audio features when predicting DRC parameters for both mono-instrument audio loops and polyphonic music pieces.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2020

Learning Audio Embeddings with User Listening Data for Content-based Music Recommendation

Personalized recommendation on new track releases has always been a chal...
research
06/22/2023

Siamese SIREN: Audio Compression with Implicit Neural Representations

Implicit Neural Representations (INRs) have emerged as a promising metho...
research
05/28/2019

SignalTrain: Profiling Audio Compressors with Deep Neural Networks

In this work we present a data-driven approach for predicting the behavi...
research
12/17/2018

Instrument-Independent Dastgah Recognition of Iranian Classical Music Using AzarNet

In this paper, AzarNet, a deep neural network (DNN), is proposed to reco...
research
04/08/2019

Adversarial Audio: A New Information Hiding Method and Backdoor for DNN-based Speech Recognition Models

Audio is an important medium in people's daily life, hidden information ...
research
10/30/2017

Hit Song Prediction for Pop Music by Siamese CNN with Ranking Loss

A model for hit song prediction can be used in the pop music industry to...
research
07/01/2019

Universal audio synthesizer control with normalizing flows

The ubiquity of sound synthesizers has reshaped music production and eve...

Please sign up or login with your details

Forgot password? Click here to reset