Daniel Korzekwa

research

∙ 07/31/2023

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

Neural text-to-speech systems are often optimized on L1/L2 losses, which...

0 Guangyan Zhang, et al. ∙

research

∙ 01/26/2023

On granularity of prosodic representations in expressive text-to-speech

In expressive speech synthesis it is widely adopted to use latent prosod...

0 Mikolaj Babianski, et al. ∙

research

∙ 09/13/2022

Automated detection of pronunciation errors in non-native English speech employing deep learning

Despite significant advances in recent years, the existing Computer-Assi...

0 Daniel Korzekwa, et al. ∙

research

∙ 07/02/2022

Computer-assisted Pronunciation Training – Speech synthesis is almost all you need

The research community has long studied computer-assisted pronunciation ...

0 Daniel Korzekwa, et al. ∙

research

∙ 03/15/2022

Text-free non-parallel many-to-many voice conversion using normalising flows

Non-parallel voice conversion (VC) is typically achieved using lossy rep...

0 Thomas Merritt, et al. ∙

research

∙ 08/13/2021

Enhancing audio quality for expressive Neural Text-to-Speech

Artificial speech synthesis has made a great leap in terms of naturalnes...

0 Abdelhamid Ezzerg, et al. ∙

research

∙ 06/24/2021

Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech

Whilst recent neural text-to-speech (TTS) approaches produce high-qualit...

0 Raahil Shah, et al. ∙

research

∙ 06/16/2021

Improving the expressiveness of neural vocoding with non-affine Normalizing Flows

This paper proposes a general enhancement to the Normalizing Flows (NF) ...

0 Adam Gabrys, et al. ∙

research

∙ 06/07/2021

Weakly-supervised word-level pronunciation error detection in non-native English speech

We propose a weakly-supervised model for word-level mispronunciation det...

0 Daniel Korzekwa, et al. ∙

research

∙ 02/01/2021

Universal Neural Vocoding with Parallel WaveNet

We present a universal neural vocoder based on Parallel WaveNet, with an...

0 Yunlong Jiao, et al. ∙

research

∙ 01/16/2021

Mispronunciation Detection in Non-native (L2) English with Uncertainty Modeling

A common approach to the automatic detection of mispronunciation in lang...

0 Daniel Korzekwa, et al. ∙

research

∙ 12/29/2020

Detection of Lexical Stress Errors in Non-native (L2) English with Data Augmentation and Attention

This paper describes two novel complementary techniques that improve the...

0 Daniel Korzekwa, et al. ∙

research

∙ 07/10/2019

Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech

This paper proposed a novel approach for the detection and reconstructio...

0 Daniel Korzekwa, et al. ∙

research

∙ 11/15/2018

Comprehensive evaluation of statistical speech waveform synthesis

Statistical TTS systems that directly predict the speech waveform have r...

0 Thomas Merritt, et al. ∙

Daniel Korzekwa

Featured Co-authors

Sign in with Google

Consider DeepAI Pro