In this work, we propose an error correction framework, named DiaCorrect...
In recent years, speaker diarization has attracted widespread attention....
Recent advances in cross-lingual text-to-speech (TTS) made it possible t...
We propose two improvements to target-speaker voice activity detection
(...
Attention based neural TTS is elegant speech synthesis pipeline and has ...
Peking Opera has been the most dominant form of Chinese performing art s...
Singing voice conversion is converting the timbre in the source singing ...
This paper investigates how to leverage a DurIAN-based average model to
...
In this paper, we propose the FeatherWave, yet another variant of WaveRN...
This paper presents a method that generates expressive singing voice of
...
We propose an algorithm that is capable of synthesizing high quality tar...
Singing voice conversion is to convert a singer's voice to another one's...
In this paper, we present a generic and robust multimodal synthesis syst...
Speaker adaptation methods aim to create fair quality synthesis speech v...
The Bidirectional LSTM (BLSTM) RNN based speech synthesis system is amon...