Sri Karlapati

research

∙ 09/04/2023

A Comparative Analysis of Pretrained Language Models for Text-to-Speech

State-of-the-art text-to-speech (TTS) systems have utilized pretrained l...

0 Marcel Granero Moya, et al. ∙

research

∙ 06/20/2023

eCat: An End-to-End Model for Multi-Speaker TTS Many-to-Many Fine-Grained Prosody Transfer

We present eCat, a novel end-to-end multispeaker model capable of: a) ge...

0 Ammar Abbas, et al. ∙

research

∙ 06/29/2022

Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody

Generating expressive and contextually appropriate prosody remains a cha...

0 Peter Makarov, et al. ∙

research

∙ 06/28/2022

Expressive, Variable, and Controllable Duration Modelling in TTS

Duration modelling has become an important research problem once more wi...

0 Ammar Abbas, et al. ∙

research

∙ 06/27/2022

CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer

In this paper, we present CopyCat2 (CC2), a novel model capable of: a) s...

0 Sri Karlapati, et al. ∙

research

∙ 06/29/2021

Multi-Scale Spectrogram Modelling for Neural Text-to-Speech

We propose a novel Multi-Scale Spectrogram (MSS) modelling approach to s...

0 Ammar Abbas, et al. ∙

research

∙ 06/14/2021

A learned conditional prior for the VAE acoustic space of a TTS system

Many factors influence speech yielding different renditions of a given s...

0 Penny Karanasou, et al. ∙

research

∙ 11/04/2020

Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech

In this paper, we introduce Kathaka, a model trained with a novel two-st...

0 Sri Karlapati, et al. ∙

research

∙ 04/30/2020

CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech

Prosody Transfer (PT) is a technique that aims to use the prosody from a...

0 Sri Karlapati, et al. ∙

Sri Karlapati

Featured Co-authors

Sign in with Google

Consider DeepAI Pro