Goeric Huybrechts

research

∙ 06/13/2023

DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer

Conformer-based end-to-end models have become ubiquitous these days and ...

0 Goeric Huybrechts, et al. ∙

research

∙ 04/18/2023

Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR

Recently, there has been an increasing interest in unifying streaming an...

0 Xilai Li, et al. ∙

research

∙ 07/29/2022

Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation

The availability of data in expressive styles across languages is limite...

0 Giulia Comini, et al. ∙

research

∙ 02/16/2022

Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module

State-of-the-art text-to-speech (TTS) systems require several hours of r...

0 Adam Gabrys, et al. ∙

research

∙ 02/10/2022

Cross-speaker style transfer for text-to-speech using data augmentation

We address the problem of cross-speaker style transfer for text-to-speec...

0 Manuel Sam Ribeiro, et al. ∙

research

∙ 06/24/2021

Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech

Whilst recent neural text-to-speech (TTS) approaches produce high-qualit...

0 Raahil Shah, et al. ∙

research

∙ 01/14/2021

EmoCat: Language-agnostic Emotional Voice Conversion

Emotional voice conversion models adapt the emotion in speech without ch...

0 Bastian Schnell, et al. ∙

research

∙ 11/11/2020

Low-resource expressive text-to-speech using data augmentation

While recent neural text-to-speech (TTS) systems perform remarkably well...

0 Goeric Huybrechts, et al. ∙

research

∙ 12/11/2019

Voice Conversion for Whispered Speech Synthesis

We present an approach to synthesize whisper by applying a handcrafted s...

0 Marius Cotescu, et al. ∙

research

∙ 03/04/2019

Traditional Machine Learning for Pitch Detection

Pitch detection is a fundamental problem in speech processing as F0 is u...

0 Thomas Drugman, et al. ∙

Goeric Huybrechts

Featured Co-authors

Sign in with Google

Consider DeepAI Pro