Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables

06/29/2023
by   Chin-Yun Yu, et al.
0

This paper introduces GlOttal-flow LPC Filter (GOLF), a novel method for singing voice synthesis (SVS) that exploits the physical characteristics of the human voice using differentiable digital signal processing. GOLF employs a glottal model as the harmonic source and IIR filters to simulate the vocal tract, resulting in an interpretable and efficient approach. We show it is competitive with state-of-the-art singing voice vocoders, requiring fewer synthesis parameters and less memory to train, and runs an order of magnitude faster for inference. Additionally, we demonstrate that GOLF can model the phase components of the human voice, which has immense potential for rendering and analysing singing voices in a differentiable manner. Our results highlight the effectiveness of incorporating the physical properties of the human voice mechanism into SVS and underscore the advantages of signal-processing-based approaches, which offer greater interpretability and efficiency in synthesis. Audio samples are available at https://yoyololicon.github.io/golf-demo/.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/29/2023

A Review of Differentiable Digital Signal Processing for Music Speech Synthesis

The term "differentiable digital signal processing" describes a family o...
research
11/05/2022

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer

End-to-end singing voice synthesis (SVS) model VISinger can achieve bett...
research
03/12/2021

Latent Space Explorations of Singing Voice Synthesis using DDSP

Machine learning based singing voice models require large datasets and l...
research
10/07/2021

Towards Universal Neural Vocoding with a Multi-band Excited WaveNet

This paper introduces the Multi-Band Excited WaveNet a neural vocoder fo...
research
06/19/2023

Vocal Timbre Effects with Differentiable Digital Signal Processing

We explore two approaches to creatively altering vocal timbre using Diff...
research
08/09/2022

DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive Evaluation

A vocoder is a conditional audio generation model that converts acoustic...
research
11/02/2022

Singing Voice Synthesis with Vibrato Modeling and Latent Energy Representation

This paper proposes an expressive singing voice synthesis system by intr...

Please sign up or login with your details

Forgot password? Click here to reset