Context-Aware Prosody Correction for Text-Based Speech Editing

02/16/2021
by   Max Morrison, et al.
0

Text-based speech editors expedite the process of editing speech recordings by permitting editing via intuitive cut, copy, and paste operations on a speech transcript. A major drawback of current systems, however, is that edited recordings often sound unnatural because of prosody mismatches around edited regions. In our work, we propose a new context-aware method for more natural sounding text-based editing of speech. To do so, we 1) use a series of neural networks to generate salient prosody features that are dependent on the prosody of speech surrounding the edit and amenable to fine-grained user control 2) use the generated features to control a standard pitch-shift and time-stretch method and 3) apply a denoising neural network to remove artifacts induced by the signal manipulation to yield a high-fidelity result. We evaluate our approach using a subjective listening test, provide a detailed comparative analysis, and conclude several interesting insights.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

09/12/2021

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Given a piece of speech and its transcript text, text-based speech editi...
10/20/2017

Detecting Online Hate Speech Using Context Aware Models

In the wake of a polarizing election, the cyber world is laden with hate...
09/07/2016

Feasibility of Post-Editing Speech Transcriptions with a Mismatched Crowd

Manual correction of speech transcription can involve a selection from p...
10/05/2021

Neural Pitch-Shifting and Time-Stretching with Controllable LPCNet

Modifying the pitch and timing of an audio signal are fundamental audio ...
08/07/2020

Controllable Neural Prosody Synthesis

Speech synthesis has recently seen significant improvements in fidelity,...
06/19/2018

Response Generation by Context-aware Prototype Editing

Open domain response generation has achieved remarkable progress in rece...
05/22/2018

Analyzing Interfaces and Workflows for Light Field Editing

With the increasing number of available consumer light field cameras, su...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.