Timbre latent space: exploration and creative aspects

08/04/2020
by   Antoine Caillon, et al.
0

Recent studies show the ability of unsupervised models to learn invertible audio representations using Auto-Encoders. They enable high-quality sound synthesis but a limited control since the latent spaces do not disentangle timbre properties. The emergence of disentangled representations was studied in Variational Auto-Encoders (VAEs), and has been applied to audio. Using an additional perceptual regularization can align such latent representation with the previously established multi-dimensional timbre spaces, while allowing continuous inference and synthesis. Alternatively, some specific sound attributes can be learned as control variables while unsupervised dimensions account for the remaining features. New possibilities for timbre manipulations are enabled with generative neural networks, although the exploration and the creative use of their representations remain little. The following experiments are led in cooperation with two composers and propose new creative directions to explore latent sound synthesis of musical timbres, using specifically designed interfaces (Max/MSP, Pure Data) or mappings for descriptor-based synthesis.

READ FULL TEXT
research
05/22/2018

Generative timbre spaces: regularizing variational auto-encoders with perceptual metrics

Timbre spaces have been used in music perception to study the perceptual...
research
08/04/2020

Neural Granular Sound Synthesis

Granular sound synthesis is a popular audio generation technique based o...
research
10/27/2022

Synthesizer Preset Interpolation using Transformer Auto-Encoders

Sound synthesizers are widespread in modern music production but they in...
research
11/30/2020

A proposal and evaluation of new timbre visualisation methods for audio sample browsers

Searching through vast libraries of sound samples can be a daunting and ...
research
05/22/2018

Generative timbre spaces with variational audio synthesis

Timbre spaces have been used in music perception to study the relationsh...
research
07/01/2019

Universal audio synthesizer control with normalizing flows

The ubiquity of sound synthesizers has reshaped music production and eve...
research
07/13/2020

Vector-Quantized Timbre Representation

Timbre is a set of perceptual attributes that identifies different types...

Please sign up or login with your details

Forgot password? Click here to reset