b'Rodrigo Mira'

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Björn W. Schuller
99 publications
Maja Pantic
82 publications
Pingchuan Ma
47 publications
Stavros Petridis
45 publications
Anurag Kumar
45 publications
Vamsi Krishna Ithapu
16 publications
Buye Xu
15 publications
Konstantinos Vougioukas
11 publications
Alexandros Haliassos
8 publications
Jacob Donley
8 publications
Nikita Drobyshev
3 publications

research

∙ 05/15/2023

Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models

Speech-driven animation has gained significant traction in recent years,...

0 Antoni Bigata Casademunt, et al. ∙

research

∙ 12/12/2022

Jointly Learning Visual and Auditory Speech Representations from Raw Data

We present RAVEn, a self-supervised multi-modal approach to jointly lear...

0 Alexandros Haliassos, et al. ∙

research

∙ 11/20/2022

LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders

Audio-visual speech enhancement aims to extract clean speech from a nois...

0 Rodrigo Mira, et al. ∙

research

∙ 05/04/2022

SVTS: Scalable Video-to-Speech Synthesis

Video-to-speech synthesis (also known as lip-to-speech) refers to the tr...

11 Rodrigo Mira, et al. ∙

research

∙ 01/18/2022

Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection

One of the most pressing challenges for the detection of face-manipulate...

3 Alexandros Haliassos, et al. ∙

research

∙ 06/16/2021

LiRA: Learning Visual Speech Representations from Audio through Self-supervision

The large amount of audiovisual content being shared online today has dr...

5 Pingchuan Ma, et al. ∙

research

∙ 04/27/2021

End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks

Video-to-speech is the process of reconstructing the audio speech from a...

12 Rodrigo Mira, et al. ∙

Rodrigo Mira

Featured Co-authors

Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models

Jointly Learning Visual and Auditory Speech Representations from Raw Data

LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders

SVTS: Scalable Video-to-Speech Synthesis

Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection

LiRA: Learning Visual Speech Representations from Audio through Self-supervision

End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks

Sign in with Google

Consider DeepAI Pro