b'Vimal Manohar'

research

∙ 06/23/2023

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

Large-scale generative models such as GPT and DALL-E have revolutionized...

0 Matthew Le, et al. ∙

research

∙ 11/23/2022

Voice-preserving Zero-shot Multiple Accent Conversion

Most people who have tried to learn a foreign language would have experi...

0 Mumin Jin, et al. ∙

research

∙ 10/28/2022

Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders

Text-based voice editing (TBVE) uses synthetic output from text-to-speec...

5 Jason Fong, et al. ∙

research

∙ 07/09/2021

On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models

Hybrid automatic speech recognition (ASR) models are typically sequentia...

0 Xiaohui Zhang, et al. ∙

research

∙ 06/14/2021

Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition

In this paper, we introduce the Kaizen framework that uses a continuousl...

0 Vimal Manohar, et al. ∙

research

∙ 05/16/2020

Large scale weakly and semi-supervised learning for low-resource video ASR

Many semi- and weakly-supervised approaches have been investigated for o...

0 Kritika Singh, et al. ∙

research

∙ 02/23/2018

The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection

We describe the system our team used during NIST's LoReHLT (Low Resource...

0 Matthew Wiesner, et al. ∙

research

∙ 06/12/2017

Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework

Speech recognition systems for irregularly-spelled languages like Englis...

0 Xiaohui Zhang, et al. ∙

research

∙ 06/01/2017

Using of heterogeneous corpora for training of an ASR system

The paper summarizes the development of the LVCSR system built as a part...

0 Jan Trmal, et al. ∙

Vimal Manohar

Featured Co-authors

Sign in with Google

Consider DeepAI Pro