Universal audio synthesizer control with normalizing flows

07/01/2019
by   Philippe Esling, et al.
3

The ubiquity of sound synthesizers has reshaped music production and even entirely defined new music genres. However, the increasing complexity and number of parameters in modern synthesizers make them harder to master. Hence, the development of methods allowing to easily create and explore with synthesizers is a crucial need. Here, we introduce a novel formulation of audio synthesizer control. We formalize it as finding an organized latent audio space that represents the capabilities of a synthesizer, while constructing an invertible mapping to the space of its parameters. By using this formulation, we show that we can address simultaneously automatic parameter inference, macro-control learning and audio-based preset exploration within a single model. To solve this new formulation, we rely on Variational Auto-Encoders (VAE) and Normalizing Flows (NF) to organize and map the respective auditory and parameter spaces. We introduce the disentangling flows, which allow to perform the invertible mapping between separate latent spaces, while steering the organization of some latent dimensions to match target variation factors by splitting the objective as partial density evaluation. We evaluate our proposal against a large set of baseline models and show its superiority in both parameter inference and audio reconstruction. We also show that the model disentangles the major factors of audio variations as latent dimensions, that can be directly used as macro-parameters. We also show that our model is able to learn semantic controls of a synthesizer by smoothly mapping to its parameters. Finally, we discuss the use of our model in creative applications and its real-time implementation in Ableton Live

READ FULL TEXT

page 4

page 6

page 8

page 9

research
05/24/2023

Sound Design Strategies for Latent Audio Space Explorations Using Deep Learning Architectures

The research in Deep Learning applications in sound and music computing ...
research
05/22/2018

Generative timbre spaces: regularizing variational auto-encoders with perceptual metrics

Timbre spaces have been used in music perception to study the perceptual...
research
08/04/2020

Timbre latent space: exploration and creative aspects

Recent studies show the ability of unsupervised models to learn invertib...
research
05/22/2018

Generative timbre spaces with variational audio synthesis

Timbre spaces have been used in music perception to study the relationsh...
research
06/18/2021

An Audio-Driven System For Real-Time Music Visualisation

Computer-generated visualisations can accompany recorded or live music t...
research
05/01/2019

A Feature Learning Siamese Model for Intelligent Control of the Dynamic Range Compressor

In this paper, a siamese DNN model is proposed to learn the characterist...
research
10/27/2022

Synthesizer Preset Interpolation using Transformer Auto-Encoders

Sound synthesizers are widespread in modern music production but they in...

Please sign up or login with your details

Forgot password? Click here to reset