StyleWaveGAN: Style-based synthesis of drum sounds with extensive controls using generative adversarial networks

04/02/2022
by   Antoine Lavault, et al.
0

In this paper we introduce StyleWaveGAN, a style-based drum sound generator that is a variation of StyleGAN, a state-of-the-art image generator. By conditioning StyleWaveGAN on both the type of drum and several audio descriptors, we are able to synthesize waveforms faster than real-time on a GPU directly in CD quality up to a duration of 1.5s while retaining a considerable amount of control over the generation. We also introduce an alternative to the progressive growing of GANs and experimented on the effect of dataset balancing for generative tasks. The experiments are carried out on an augmented subset of a publicly available dataset comprised of different drums and cymbals. We evaluate against two recent drum generators, WaveGAN and NeuroDrum, demonstrating significantly improved generation quality (measured with the Frechet Audio Distance) and interesting results with perceptual features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2020

DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks

Synthetic creation of drum sounds (e.g., in drum machines) is commonly p...
research
06/16/2020

Comparing Representations for Audio Synthesis Using Generative Adversarial Networks

In this paper, we compare different audio signal representations, includ...
research
08/29/2023

Learning Modulated Transformation in GANs

The success of style-based generators largely benefits from style modula...
research
12/10/2020

Slimmable Generative Adversarial Networks

Generative adversarial networks (GANs) have achieved remarkable progress...
research
10/14/2021

SpecSinGAN: Sound Effect Variation Synthesis Using Single-Image GANs

Single-image generative adversarial networks learn from the internal dis...
research
05/18/2020

Unconditional Audio Generation with Generative Adversarial Networks and Cycle Regularization

In a recent paper, we have presented a generative adversarial network (G...
research
08/03/2021

DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio Synthesis with GANs

Generative Adversarial Networks (GANs) have achieved excellent audio syn...

Please sign up or login with your details

Forgot password? Click here to reset