Due to the sequential nature of the successive-cancellation (SC) algorit...
Expressive text-to-speech (TTS) has become a hot research topic recently...
Overlapping speech diarization is always treated as a multi-label
classi...
Federated learning enables collaborative training of machine learning mo...
We propose BeamTransformer, an efficient architecture to leverage
beamfo...
Recently, there has been an increasing interest in neural speech synthes...
Recently, end-to-end (E2E) speech recognition has become popular, since ...
With the number of smart devices increasing, the demand for on-device
te...
Recently, online end-to-end ASR has gained increasing attention. However...
Linear Programming (LP) is an important decoding technique for binary li...
Transformer models have been introduced into end-to-end speech recogniti...
Recently, streaming end-to-end automatic speech recognition (E2E-ASR) ha...
End-to-end speech recognition has become popular in recent years, since ...
Inspired by the recent advances in deep learning (DL), this work present...
In this work, we consider the use of model-driven deep learning techniqu...
Connectionist Temporal Classification (CTC) based end-to-end speech
reco...
Speaker adaptation methods aim to create fair quality synthesis speech v...
In this paper, we present an improved feedforward sequential memory netw...
The Bidirectional LSTM (BLSTM) RNN based speech synthesis system is amon...
Fourier ptychographic microscopy (FPM) is a recently proposed computatio...