Audio super-resolution is a fundamental task that predicts high-frequenc...
While the performance of cross-lingual TTS based on monolingual corpora ...
Automatic dubbing, which generates a corresponding version of the input
...
Some recent studies have demonstrated the feasibility of single-stage ne...
Speech restoration aims to remove distortions in speech signals. Prior
m...
Although deep learning and end-to-end models have been widely used and s...
Speech super-resolution (SR) is a task to increase speech sampling rate ...
Dubbing is a post-production process of re-recording actors' dialogues, ...
With the increasing popularity of speech synthesis products, the industr...
Speech restoration aims to remove distortions in speech signals. Prior
m...
Attention based neural TTS is elegant speech synthesis pipeline and has ...
This paper investigates how to leverage a DurIAN-based average model to
...
In this paper, we propose the FeatherWave, yet another variant of WaveRN...
Neural networks based vocoders, typically the WaveNet, have achieved
spe...