Neural Granular Sound Synthesis

08/04/2020
by   Adrien Bitton, et al.
0

Granular sound synthesis is a popular audio generation technique based on rearranging sequences of small waveform windows. In order to control the synthesis, all grains in a given corpus are analyzed through a set of acoustic descriptors. This provides a representation reflecting some form of local similarities across the grains. However, the quality of this grain space is bound by that of the descriptors. Its traversal is not continuously invertible to signal and does not render any structured temporality. We demonstrate that generative neural networks can implement granular synthesis while alleviating most of its shortcomings. We efficiently replace its audio descriptor basis by a probabilistic latent space learned with a Variational Auto-Encoder. A major advantage of our proposal is that the resulting grain space is invertible, meaning that we can continuously synthesize sound when traversing its dimensions. It also implies that original grains are not stored for synthesis. To learn structured paths inside this latent space, we add a higher-level temporal embedding trained on arranged grain sequences. The model can be applied to many types of libraries, including pitched notes or unpitched drums and environmental noises. We experiment with the common granular synthesis processes and enable new ones.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2020

Vector-Quantized Timbre Representation

Timbre is a set of perceptual attributes that identifies different types...
research
07/18/2023

Interpretable Timbre Synthesis using Variational Autoencoders Regularized on Timbre Descriptors

Controllable timbre synthesis has been a subject of research for several...
research
08/04/2020

Timbre latent space: exploration and creative aspects

Recent studies show the ability of unsupervised models to learn invertib...
research
04/25/2021

Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis

Speech synthesis and music audio generation from symbolic input differ i...
research
10/27/2022

Synthesizer Preset Interpolation using Transformer Auto-Encoders

Sound synthesizers are widespread in modern music production but they in...
research
04/12/2019

Assisted Sound Sample Generation with Musical Conditioning in Adversarial Auto-Encoders

Generative models have thrived in computer vision, enabling unprecedente...
research
07/21/2022

A Proposal for Foley Sound Synthesis Challenge

"Foley" refers to sound effects that are added to multimedia during post...

Please sign up or login with your details

Forgot password? Click here to reset