Srikanth Ronanki

research

∙ 06/13/2023

DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer

Conformer-based end-to-end models have become ubiquitous these days and ...

0 Goeric Huybrechts, et al. ∙

research

∙ 04/18/2023

Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR

Recently, there has been an increasing interest in unifying streaming an...

0 Xilai Li, et al. ∙

research

∙ 11/23/2022

Device Directedness with Contextual Cues for Spoken Dialog Systems

In this work, we define barge-in verification as a supervised learning t...

0 Dhanush Bekal, et al. ∙

research

∙ 10/18/2022

Personalization of CTC Speech Recognition Models

End-to-end speech recognition models trained using joint Connectionist T...

6 Saket Dingliwal, et al. ∙

research

∙ 04/21/2021

Adapting Long Context NLM for ASR Rescoring in Conversational Agents

Neural Language Models (NLM), when trained and evaluated with context sp...

13 Ashish Shenoy, et al. ∙

research

∙ 11/30/2020

Transformer-Transducers for Code-Switched Speech Recognition

We live in a world where 60 languages fluently. Members of these communi...

0 Siddharth Dalmia, et al. ∙

research

∙ 08/03/2020

Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech

In this work, we explore a multimodal semi-supervised learning approach ...

0 Monica Sunkara, et al. ∙

research

∙ 07/04/2020

Robust Prediction of Punctuation and Truecasing for Medical ASR

Automatic speech recognition (ASR) systems in the medical domain that fo...

0 Monica Sunkara, et al. ∙

research

∙ 07/04/2020

Robust Prediction of Punctuation and Truecasingfor Medical ASR

Automatic speech recognition (ASR) systems in the medical domain that fo...

0 Monica Sunkara, et al. ∙

research

∙ 11/05/2019

The ASVspoof 2019 database

Automatic speaker verification (ASV) is one of the most natural and conv...

0 Xin Wang, et al. ∙

research

∙ 07/04/2019

Fine-grained robust prosody transfer for single-speaker neural text-to-speech

We present a neural text-to-speech system for fine-grained prosody trans...

0 Viacheslav Klimkov, et al. ∙

research

∙ 04/04/2019

In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data

Neural text-to-speech synthesis (NTTS) models have shown significant pro...

0 Nishant Prateek, et al. ∙

research

∙ 08/22/2016

Median-Based Generation of Synthetic Speech Durations using a Non-Parametric Approach

This paper proposes a new approach to duration modelling for statistical...

0 Srikanth Ronanki, et al. ∙

research

∙ 08/18/2016

DNN-based Speech Synthesis for Indian Languages from ASCII text

Text-to-Speech synthesis in Indian languages has a seen lot of progress ...

0 Srikanth Ronanki, et al. ∙

Srikanth Ronanki

Featured Co-authors

Sign in with Google

Consider DeepAI Pro