Manuel Sam Ribeiro

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Fan Yang
225 publications
Steve Renals
40 publications
Gustav Eje Henter
40 publications
Jaime Lorenzo-Trueba
29 publications
Roberto Barra-Chicote
22 publications
Daniel Korzekwa
14 publications
Korin Richmond
13 publications
Thomas Merritt
11 publications
Piotr Bilinski
10 publications
Jing-Xuan Zhang
10 publications
Goeric Huybrechts
10 publications

research

∙ 07/31/2023

Multilingual context-based pronunciation learning for Text-to-Speech

Phonetic information and linguistic knowledge are an essential component...

0 Giulia Comini, et al. ∙

research

∙ 07/31/2023

Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech

Neural text-to-speech systems are often optimized on L1/L2 losses, which...

0 Guangyan Zhang, et al. ∙

research

∙ 07/31/2023

Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings

The Grapheme-to-Phoneme (G2P) task aims to convert orthographic input in...

0 Manuel Sam Ribeiro, et al. ∙

research

∙ 09/22/2022

Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks

Automatically predicting the outcome of subjective listening tests is a ...

0 Cassia Valentini-Botinhao, et al. ∙

research

∙ 07/29/2022

Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation

The availability of data in expressive styles across languages is limite...

0 Giulia Comini, et al. ∙

research

∙ 02/16/2022

Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module

State-of-the-art text-to-speech (TTS) systems require several hours of r...

0 Adam Gabrys, et al. ∙

research

∙ 02/10/2022

Cross-speaker style transfer for text-to-speech using data augmentation

We address the problem of cross-speaker style transfer for text-to-speec...

0 Manuel Sam Ribeiro, et al. ∙

research

∙ 05/31/2021

Automatic audiovisual synchronisation for ultrasound tongue imaging

Ultrasound tongue imaging is used to visualise the intra-oral articulato...

0 Aciel Eshky, et al. ∙

research

∙ 02/27/2021

Silent versus modal multi-speaker speech recognition from ultrasound and video

We investigate multi-speaker speech recognition from ultrasound images o...

0 Manuel Sam Ribeiro, et al. ∙

research

∙ 02/27/2021

Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors

Speech sound disorders are a common communication impairment in childhoo...

0 Manuel Sam Ribeiro, et al. ∙

research

∙ 11/19/2020

TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos

We present the Tongue and Lips corpus (TaL), a multi-speaker corpus of a...

0 Manuel Sam Ribeiro, et al. ∙

research

∙ 07/01/2019

UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions

We introduce UltraSuite, a curated repository of ultrasound and acoustic...

0 Aciel Eshky, et al. ∙

research

∙ 07/01/2019

Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions

We investigate the automatic processing of child speech therapy sessions...

0 Manuel Sam Ribeiro, et al. ∙

research

∙ 07/01/2019

Synchronising audio and ultrasound by learning cross-modal embeddings

Audiovisual synchronisation is the task of determining the time offset b...

2 Aciel Eshky, et al. ∙

research

∙ 07/01/2019

Speaker-independent classification of phonetic segments from raw ultrasound in child speech

Ultrasound tongue imaging (UTI) provides a convenient way to visualize t...

3 Manuel Sam Ribeiro, et al. ∙

Manuel Sam Ribeiro

Featured Co-authors

Sign in with Google

Consider DeepAI Pro