In a controllable text generation dataset, there exist unannotated attri...
Talking face generation has been extensively investigated owing to its w...
We propose a method for scene-level sketch-to-photo synthesis with text
...
The recent text-to-speech (TTS) has achieved quality comparable to that ...
This paper proposes a hierarchical generative model with a multi-grained...
Dancing to music is one of human's innate abilities since ancient times....