An Initial study on Birdsong Re-synthesis Using Neural Vocoders

09/21/2022
by   Rhythm Bhatia, et al.
0

Modern speech synthesis uses neural vocoders to model raw waveform samples directly. This increased versatility has expanded the scope of vocoders from speech to other domains, such as music. We address another interesting domain of bio-acoustics. We provide initial comparative analysis-resynthesis experiments of birdsong using traditional (WORLD) and two neural (WaveNet autoencoder, parallel WaveGAN) vocoders. Our subjective results indicate no difference in the three vocoders in terms of species discrimination (ABX test). Nonetheless, the WORLD vocoder samples were rated higher in terms of retaining bird-like qualities (MOS test). All vocoders faced issues with pitch and voicing. Our results indicate some of the challenges in processing low-quality wildlife audio data.

READ FULL TEXT
research
10/27/2019

Transferring neural speech waveform synthesizers to musical instrument sounds generation

Recent neural waveform synthesizers such as WaveNet, WaveGlow, and the n...
research
04/25/2021

Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis

Speech synthesis and music audio generation from symbolic input differ i...
research
11/16/2022

Conditional variational autoencoder to improve neural audio synthesis for polyphonic music sound

Deep generative models for audio synthesis have recently been significan...
research
11/01/2018

Neural Music Synthesis for Flexible Timbre Control

The recent success of raw audio waveform synthesis models like WaveNet m...
research
08/22/2023

Evaluation of the Speech Resynthesis Capabilities of the VoicePrivacy Challenge Baseline B1

Speaker anonymization systems continue to improve their ability to obfus...
research
08/20/2018

Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks

We propose the multi-head convolutional neural network (MCNN) architectu...
research
03/31/2022

Manipulation of oral cancer speech using neural articulatory synthesis

We present an articulatory synthesis framework for the synthesis and man...

Please sign up or login with your details

Forgot password? Click here to reset