FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

06/29/2021
by   Taejun Bak, et al.
0

Methods for modeling and controlling prosody with acoustic features have been proposed for neural text-to-speech (TTS) models. Prosodic speech can be generated by conditioning acoustic features. However, synthesized speech with a large pitch-shift scale suffers from audio quality degradation, and speaker characteristics deformation. To address this problem, we propose a feed-forward Transformer based TTS model that is designed based on the source-filter theory. This model, called FastPitchFormant, has a unique structure that handles text and acoustic features in parallel. With modeling each feature separately, the tendency that the model learns the relationship between two features can be mitigated.

READ FULL TEXT
research
04/25/2018

Speaker-independent raw waveform model for glottal excitation

Recent speech technology research has seen a growing interest in using W...
research
09/13/2023

Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms

Recent strides in neural speech synthesis technologies, while enjoying w...
research
11/19/2018

Limitations of Source-Filter Coupling In Phonation

The coupling of vocal fold (source) and vocal tract (filter) is one of t...
research
03/18/2022

A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing

Recently, speech representation learning has improved many speech-relate...
research
10/08/2020

Classification of Speech with and without Face Mask using Acoustic Features

The understanding and interpretation of speech can be affected by variou...
research
02/14/2023

Synthesizing audio from tongue motion during speech using tagged MRI via transformer

Investigating the relationship between internal tissue point motion of t...
research
04/16/2020

Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders

In our previous work, we have proposed a neural vocoder called HiNet whi...

Please sign up or login with your details

Forgot password? Click here to reset