Ming Lei

research

∙ 04/26/2022

Fast Successive-Cancellation Decoding of Polar Codes with Sequence Nodes

Due to the sequential nature of the successive-cancellation (SC) algorit...

0 Yang Lu, et al. ∙

research

∙ 02/16/2022

ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech

Expressive text-to-speech (TTS) has become a hot research topic recently...

0 Yi Ren, et al. ∙

research

∙ 11/28/2021

Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information

Overlapping speech diarization is always treated as a multi-label classi...

0 Zhihao Du, et al. ∙

research

∙ 10/14/2021

FedSpeech: Federated Text-to-Speech with Continual Learning

Federated learning enables collaborative training of machine learning mo...

0 Ziyue Jiang, et al. ∙

research

∙ 09/09/2021

BeamTransformer: Microphone Array-based Overlapping Speech Detection

We propose BeamTransformer, an efficient architecture to leverage beamfo...

0 Siqi Zheng, et al. ∙

research

∙ 06/17/2021

EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model

Recently, there has been an increasing interest in neural speech synthes...

0 Chenye Cui, et al. ∙

research

∙ 04/06/2021

Extremely Low Footprint End-to-End ASR System for Smart Device

Recently, end-to-end (E2E) speech recognition has become popular, since ...

0 Zhifu Gao, et al. ∙

research

∙ 10/29/2020

DeviceTTS: A Small-Footprint, Fast, Stable Network for On-Device Text-to-Speech

With the number of smart devices increasing, the demand for on-device te...

0 Zhiying Huang, et al. ∙

research

∙ 10/27/2020

Universal ASR: Unifying Streaming and Non-Streaming ASR Using a Single Encoder-Decoder Model

Recently, online end-to-end ASR has gained increasing attention. However...

0 Zhifu Gao, et al. ∙

research

∙ 06/11/2020

A PDD Decoder for Binary Linear Codes With Neural Check Polytope Projection

Linear Programming (LP) is an important decoding technique for binary li...

0 Yi Wei, et al. ∙

research

∙ 05/21/2020

Simplified Self-Attention for Transformer-based End-to-End Speech Recognition

Transformer models have been introduced into end-to-end speech recogniti...

0 Haoneng Luo, et al. ∙

research

∙ 05/21/2020

Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition

Recently, streaming end-to-end automatic speech recognition (E2E-ASR) ha...

0 Shiliang Zhang, et al. ∙

research

∙ 05/21/2020

SAN-M: Memory Equipped Self-Attention for End-to-End Speech Recognition

End-to-end speech recognition has become popular in recent years, since ...

0 Zhifu Gao, et al. ∙

research

∙ 02/14/2020

ADMM-based Decoder for Binary Linear Codes Aided by Deep Learning

Inspired by the recent advances in deep learning (DL), this work present...

0 Yi Wei, et al. ∙

research

∙ 06/10/2019

Learned Conjugate Gradient Descent Network for Massive MIMO Detection

In this work, we consider the use of model-driven deep learning techniqu...

0 Yi Wei, et al. ∙

research

∙ 03/27/2019

Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition

Connectionist Temporal Classification (CTC) based end-to-end speech reco...

0 Shiliang Zhang, et al. ∙

research

∙ 03/05/2018

Linear networks based speaker adaptation for speech synthesis

Speaker adaptation methods aim to create fair quality synthesis speech v...

0 Zhiying Huang, et al. ∙

research

∙ 03/04/2018

Deep-FSMN for Large Vocabulary Continuous Speech Recognition

In this paper, we present an improved feedforward sequential memory netw...

0 Shiliang Zhang, et al. ∙

research

∙ 02/26/2018

Deep Feed-forward Sequential Memory Networks for Speech Synthesis

The Bidirectional LSTM (BLSTM) RNN based speech synthesis system is amon...

0 Mengxiao Bi, et al. ∙

research

∙ 06/04/2017

Data preprocessing methods for robust Fourier ptychographic microscopy

Fourier ptychographic microscopy (FPM) is a recently proposed computatio...

0 Yan Zhang, et al. ∙

Ming Lei

Featured Co-authors

Sign in with Google

Consider DeepAI Pro