Zhenyao Zhu

research

∙ 10/10/2018

Fully Supervised Speaker Diarization

In this paper, we propose a fully supervised speaker diarization approac...

0 Aonan Zhang, et al. ∙

research

∙ 07/24/2017

Exploring Neural Transducers for End-to-End Speech Recognition

In this work, we perform an empirical comparison among the CTC, RNN-Tran...

0 Eric Battenberg, et al. ∙

research

∙ 05/25/2017

Principled Hybrids of Generative and Discriminative Domain Adaptation

We propose a probabilistic framework for domain adaptation that blends b...

0 Han Zhao, et al. ∙

research

∙ 05/11/2017

Reducing Bias in Production Speech Models

Replacing hand-engineered pipelines with end-to-end deep learning system...

0 Eric Battenberg, et al. ∙

research

∙ 05/05/2017

Deep Speaker: an End-to-End Neural Speaker Embedding System

We present Deep Speaker, a neural speaker embedding system that maps utt...

0 Chao Li, et al. ∙

research

∙ 03/01/2017

Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling

Most existing sequence labelling models rely on a fixed decomposition of...

0 Hairong Liu, et al. ∙

research

∙ 03/31/2016

Learning Multiscale Features Directly From Waveforms

Deep learning has dramatically improved the performance of speech recogn...

0 Zhenyao Zhu, et al. ∙

research

∙ 12/08/2015

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

We show that an end-to-end deep learning approach can be used to recogni...

0 Dario Amodei, et al. ∙

research

∙ 09/11/2014

DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection

In this paper, we propose multi-stage and deformable deep convolutional ...

0 Wanli Ouyang, et al. ∙

research

∙ 06/26/2014

Deep Learning Multi-View Representation for Face Recognition

Various factors, such as identities, views (poses), and illuminations, a...

0 Zhenyao Zhu, et al. ∙

Zhenyao Zhu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro