Diffusion models have shown promising results in cross-modal generation
...
Contrastive learning has shown remarkable success in the field of multim...
Data is the lifeblood of modern machine learning systems, including for ...
Musical expression requires control of both what notes are played, and h...
Automated audio captioning aims to use natural language to describe the
...
False claims about COVID-19 vaccines can undermine public trust in ongoi...
Peking Opera has been the most dominant form of Chinese performing art s...
Singing voice conversion is converting the timbre in the source singing ...
This paper presents a method that generates expressive singing voice of
...
We propose an algorithm that is capable of synthesizing high quality tar...