Voice command generation using Progressive Wavegans

03/13/2019
by   Thomas Wiest, et al.
7

Generative Adversarial Networks (GANs) have become exceedingly popular in a wide range of data-driven research fields, due in part to their success in image generation. Their ability to generate new samples, often from only a small amount of input data, makes them an exciting research tool in areas with limited data resources. One less-explored application of GANs is the synthesis of speech and audio samples. Herein, we propose a set of extensions to the WaveGAN paradigm, a recently proposed approach for sound generation using GANs. The aim of these extensions - preprocessing, Audio-to-Audio generation, skip connections and progressive structures - is to improve the human likeness of synthetic speech samples. Scores from listening tests with 30 volunteers demonstrated a moderate improvement (Cohen's d coefficient of 0.65) in human likeness using the proposed extensions compared to the original WaveGAN approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2018

Synthesizing Audio with Generative Adversarial Networks

While Generative Adversarial Networks (GANs) have seen wide success at t...
research
04/15/2021

EnvGAN: Adversarial Synthesis of Environmental Sounds for Data Augmentation

The research in Environmental Sound Classification (ESC) has been progre...
research
10/25/2018

Reducing over-smoothness in speech synthesis using Generative Adversarial Networks

Speech synthesis is widely used in many practical applications. In recen...
research
05/08/2018

ReGAN: RE[LAX|BAR|INFORCE] based Sequence Generation using GANs

Generative Adversarial Networks (GANs) have seen steep ascension to the ...
research
04/01/2021

Collaborative Learning to Generate Audio-Video Jointly

There have been a number of techniques that have demonstrated the genera...
research
10/15/2020

The power of pictures: using ML assisted image generation to engage the crowd in complex socioscientific problems

Human-computer image generation using Generative Adversarial Networks (G...
research
12/17/2021

NFTGAN: Non-Fungible Token Art Generation Using Generative Adversarial Networks

Digital arts have gained an unprecedented level of popularity with the e...

Please sign up or login with your details

Forgot password? Click here to reset