Songxiang Liu

research

∙ 06/26/2023

The Singing Voice Conversion Challenge 2023

We present the latest iteration of the voice conversion challenge (VCC) ...

0 Wen-Chin Huang, et al. ∙

research

∙ 05/04/2023

HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec

Audio codec models are widely used in audio communication as a crucial t...

0 Dongchao Yang, et al. ∙

research

∙ 01/31/2023

InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt

Expressive text-to-speech (TTS) aims to synthesize different speaking st...

0 Dongchao Yang, et al. ∙

research

∙ 11/04/2022

NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS

Expressive text-to-speech (TTS) can synthesize a new speaking style by i...

0 Dongchao Yang, et al. ∙

research

∙ 02/18/2022

Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation

Dysarthric speech reconstruction (DSR), which aims to improve the qualit...

0 Disong Wang, et al. ∙

research

∙ 01/28/2022

DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Denoising diffusion probabilistic models (DDPMs) are expressive generati...

0 Songxiang Liu, et al. ∙

research

∙ 11/14/2021

Meta-Voice: Fast few-shot style transfer for expressive voice cloning using meta learning

The task of few-shot style transfer for voice cloning in text-to-speech ...

0 Songxiang Liu, et al. ∙

research

∙ 09/08/2021

Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis

Cross-speaker style transfer (CSST) in text-to-speech (TTS) synthesis ai...

0 Songxiang Liu, et al. ∙

research

∙ 08/30/2021

ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding

Language understanding in speech-based systems have attracted much atten...

3 Lingyun Feng, et al. ∙

research

∙ 05/28/2021

DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion

Singing voice conversion (SVC) is one promising technique which can enri...

0 Songxiang Liu, et al. ∙

research

∙ 02/12/2021

VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention

This paper proposes VARA-TTS, a non-autoregressive (non-AR) text-to-spee...

4 Peng Liu, et al. ∙

research

∙ 09/06/2020

Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling

This paper proposes an any-to-many location-relative, sequence-to-sequen...

0 Songxiang Liu, et al. ∙

research

∙ 03/06/2020

Defense against adversarial attacks on spoofing countermeasures of ASV

Various forefront countermeasure methods for automatic speaker verificat...

0 Haibin Wu, et al. ∙

research

∙ 10/19/2019

Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification

High-performance spoofing countermeasure systems for automatic speaker v...

0 Songxiang Liu, et al. ∙

Songxiang Liu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro