Blind Estimation of Audio Processing Graph

03/15/2023
by   Sungho Lee, et al.
0

Musicians and audio engineers sculpt and transform their sounds by connecting multiple processors, forming an audio processing graph. However, most deep-learning methods overlook this real-world practice and assume fixed graph settings. To bridge this gap, we develop a system that reconstructs the entire graph from a given reference audio. We first generate a realistic graph-reference pair dataset and train a simple blind estimation system composed of a convolutional reference encoder and a transformer-based graph decoder. We apply our model to singing voice effects and drum mixing estimation tasks. Evaluation results show that our method can reconstruct complex signal routings, including multi-band processing and sidechaining.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/15/2021

Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech

Deep learning approaches have emerged that aim to transform an audio sig...
research
01/11/2022

MR-SVS: Singing Voice Synthesis with Multi-Reference Encoder

Multi-speaker singing voice synthesis is to generate the singing voice s...
research
09/15/2023

Audio Difference Learning for Audio Captioning

This study introduces a novel training paradigm, audio difference learni...
research
11/18/2021

DawDreamer: Bridging the Gap Between Digital Audio Workstations and Python Interfaces

Audio production techniques which previously only existed in GUI-constra...
research
11/04/2022

Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects

We propose an end-to-end music mixing style transfer system that convert...
research
08/23/2023

An Initial Exploration: Learning to Generate Realistic Audio for Silent Video

Generating realistic audio effects for movies and other media is a chall...
research
05/30/2023

GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning Benchmarks

Label errors have been found to be prevalent in popular text, vision, an...

Please sign up or login with your details

Forgot password? Click here to reset