Pedro Moreno | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Bo Li
461 publications
Yu Zhang
406 publications
Yonghui Wu
72 publications
Wei Han
71 publications
Tara N. Sainath
66 publications
Rohit Prabhavalkar
49 publications
Ankur Bapna
47 publications
Zhong Meng
44 publications
Chung-Cheng Chiu
42 publications
Bhuvana Ramabhadran
36 publications
Françoise Beaufays
36 publications

research

∙ 03/02/2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

We introduce the Universal Speech Model (USM), a single large model that...

0 Yu Zhang, et al. ∙

research

∙ 10/18/2022

Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR

Training state-of-the-art Automated Speech Recognition (ASR) models typi...

0 Zhehuai Chen, et al. ∙

research

∙ 04/07/2022

MAESTRO: Matched Speech Text Representations through Modality Matching

We present Maestro, a self-supervised training method to unify represent...

0 Zhehuai Chen, et al. ∙

research

∙ 02/24/2022

Ask2Mask: Guided Data Selection for Masked Speech Modeling

Masked speech modeling (MSM) methods such as wav2vec2 or w2v-BERT learn ...

0 Murali Karthick Baskar, et al. ∙

research

∙ 08/27/2021

Injecting Text in Self-Supervised Speech Pretraining

Self-supervised pretraining for Automated Speech Recognition (ASR) has s...

0 Zhehuai Chen, et al. ∙

research

∙ 09/25/2019

Speech Recognition with Augmented Synthesized Speech

Recent success of the Tacotron speech synthesis architecture and its var...

0 Andrew Rosenberg, et al. ∙

research

∙ 09/24/2018

From Audio to Semantics: Approaches to end-to-end spoken language understanding

Conventional spoken language understanding systems consist of two main c...

0 Parisa Haghani, et al. ∙

research

∙ 11/06/2017

Multilingual Speech Recognition With A Single End-To-End Model

Training a conventional automatic speech recognition (ASR) system to sup...

0 Shubham Toshniwal, et al. ∙

Success!

An error occurred