Text-based speech editing (TSE) techniques are designed to enable users ...
Recently, there has been a growing interest in the field of controllable...
Zero-shot text-to-speech aims at synthesizing voices with unseen speech
...
Scaling text-to-speech to a large and wild dataset has been proven to be...
We are interested in a novel task, namely low-resource text-to-talking
a...
Various applications of voice synthesis have been developed independentl...
Stutter removal is an essential scenario in the field of speech editing....
Improving text representation has attracted much attention to achieve
ex...
Generating talking person portraits with arbitrary speech audio is a cru...
Generating photo-realistic video portrait with arbitrary speech audio is...
Polyphone disambiguation aims to capture accurate pronunciation knowledg...
Federated learning enables collaborative training of machine learning mo...