We present the latest iteration of the voice conversion challenge (VCC)
...
Audio codec models are widely used in audio communication as a crucial
t...
Expressive text-to-speech (TTS) aims to synthesize different speaking st...
Expressive text-to-speech (TTS) can synthesize a new speaking style by
i...
Dysarthric speech reconstruction (DSR), which aims to improve the qualit...
Denoising diffusion probabilistic models (DDPMs) are expressive generati...
The task of few-shot style transfer for voice cloning in text-to-speech ...
Cross-speaker style transfer (CSST) in text-to-speech (TTS) synthesis ai...
Language understanding in speech-based systems have attracted much atten...
Singing voice conversion (SVC) is one promising technique which can enri...
This paper proposes VARA-TTS, a non-autoregressive (non-AR) text-to-spee...
This paper proposes an any-to-many location-relative, sequence-to-sequen...
Various forefront countermeasure methods for automatic speaker verificat...
High-performance spoofing countermeasure systems for automatic speaker
v...