Zhao You

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Shinji Watanabe
239 publications
Jie Chen
218 publications
Dong Yu
160 publications
Helen Meng
108 publications
Zhiyong Wu
70 publications
Dan Su
60 publications
Sanjeev Khudanpur
45 publications
Junbo Zhang
45 publications
Chao Weng
40 publications
Yujun Wang
34 publications
Wei Zou
33 publications

research

∙ 09/04/2023

Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation

Mapping two modalities, speech and text, into a shared representation sp...

0 Jiaxu Zhu, et al. ∙

research

∙ 04/07/2022

3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition

Recently, Conformer based CTC/AED model has become a mainstream architec...

0 Zhao You, et al. ∙

research

∙ 11/23/2021

SpeechMoE2: Mixture-of-Experts Model with Improved Routing

Mixture-of-experts based acoustic models with dynamic routing mechanisms...

0 Zhao You, et al. ∙

research

∙ 06/13/2021

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

This paper introduces GigaSpeech, an evolving, multi-domain English spee...

0 Guoguo Chen, et al. ∙

research

∙ 05/07/2021

SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts

Recently, Mixture of Experts (MoE) based Transformer has shown promising...

0 Zhao You, et al. ∙

research

∙ 10/28/2019

DFSMN-SAN with Persistent Memory Model for Automatic Speech Recognition

Self-attention networks (SAN) have been introduced into automatic speech...

0 Zhao You, et al. ∙

research

∙ 07/09/2019

Teach an all-rounder with experts in different domains

In many automatic speech recognition (ASR) tasks, an ideal model has to ...

0 Zhao You, et al. ∙

Success!

An error occurred

Zhao You

Featured Co-authors

Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation

3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition

SpeechMoE2: Mixture-of-Experts Model with Improved Routing

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts

DFSMN-SAN with Persistent Memory Model for Automatic Speech Recognition

Teach an all-rounder with experts in different domains

Sign in with Google

Consider DeepAI Pro