Heng Lu

research

∙ 09/15/2023

DiaCorrect: Error Correction Back-end For Speaker Diarization

In this work, we propose an error correction framework, named DiaCorrect...

0 Jiangyu Han, et al. ∙

research

∙ 10/31/2022

DiaCorrect: End-to-end error correction for speaker diarization

In recent years, speaker diarization has attracted widespread attention....

0 Jiangyu Han, et al. ∙

research

∙ 02/22/2022

Improving Cross-lingual Speech Synthesis with Triplet Training Scheme

Recent advances in cross-lingual text-to-speech (TTS) made it possible t...

0 Jianhao Ye, et al. ∙

research

∙ 02/10/2022

The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge

We propose two improvements to target-speaker voice activity detection (...

1 Maokui He, et al. ∙

research

∙ 11/02/2020

FeatherTTS: Robust and Efficient attention based Neural TTS

Attention based neural TTS is elegant speech synthesis pipeline and has ...

0 Qiao Tian, et al. ∙

research

∙ 08/07/2020

Peking Opera Synthesis via Duration Informed Attention Network

Peking Opera has been the most dominant form of Chinese performing art s...

0 Yusong Wu, et al. ∙

research

∙ 08/07/2020

DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System

Singing voice conversion is converting the timbre in the source singing ...

0 Liqiang Zhang, et al. ∙

research

∙ 05/12/2020

AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN

This paper investigates how to leverage a DurIAN-based average model to ...

0 Zewang Zhang, et al. ∙

research

∙ 05/12/2020

FeatherWave: An efficient high-fidelity neural vocoder with multi-band linear prediction

In this paper, we propose the FeatherWave, yet another variant of WaveRN...

0 Qiao Tian, et al. ∙

research

∙ 12/27/2019

Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network

This paper presents a method that generates expressive singing voice of ...

0 Yusong Wu, et al. ∙

research

∙ 12/20/2019

Learning Singing From Speech

We propose an algorithm that is capable of synthesizing high quality tar...

0 Liqiang Zhang, et al. ∙

research

∙ 12/04/2019

PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network

Singing voice conversion is to convert a singer's voice to another one's...

0 Chengqi Deng, et al. ∙

research

∙ 09/04/2019

DurIAN: Duration Informed Attention Network For Multimodal Synthesis

In this paper, we present a generic and robust multimodal synthesis syst...

0 Chengzhu Yu, et al. ∙

research

∙ 03/05/2018

Linear networks based speaker adaptation for speech synthesis

Speaker adaptation methods aim to create fair quality synthesis speech v...

0 Zhiying Huang, et al. ∙

research

∙ 02/26/2018

Deep Feed-forward Sequential Memory Networks for Speech Synthesis

The Bidirectional LSTM (BLSTM) RNN based speech synthesis system is amon...

0 Mengxiao Bi, et al. ∙

Heng Lu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro