DualLip: A System for Joint Lip Reading and Generation

09/12/2020
by   Weicong Chen, et al.
0

Lip reading aims to recognize text from talking lip, while lip generation aims to synthesize talking lip according to text, which is a key component in talking face generation and is a dual task of lip reading. In this paper, we develop DualLip, a system that jointly improves lip reading and generation by leveraging the task duality and using unlabeled text and lip video data. The key ideas of the DualLip include: 1) Generate lip video from unlabeled text with a lip generation model, and use the pseudo pairs to improve lip reading; 2) Generate text from unlabeled lip video with a lip reading model, and use the pseudo pairs to improve lip generation. We further extend DualLip to talking face generation with two additionally introduced components: lip to face generation and text to speech generation. Experiments on GRID and TCD-TIMIT demonstrate the effectiveness of DualLip on improving lip reading, lip generation, and talking face generation by utilizing unlabeled data. Specifically, the lip generation model in our DualLip system trained with only10 paired data. And on the GRID benchmark of lip reading, we achieve 1.16 character error rate and 2.71 state-of-the-art models using the same amount of paired data.

READ FULL TEXT
research
04/10/2021

Lip reading using external viseme decoding

Lip-reading is the operation of recognizing speech from lip movements. T...
research
03/29/2023

Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert

Talking face generation, also known as speech-to-lip generation, reconst...
research
06/29/2021

GuidedMix-Net: Learning to Improve Pseudo Masks Using Labeled Images as Reference

Semi-supervised learning is a challenging problem which aims to construc...
research
06/15/2018

Deep Lip Reading: a comparison of models and an online application

The goal of this paper is to develop state-of-the-art models for lip rea...
research
04/10/2023

Automated Reading Passage Generation with OpenAI's Large Language Model

The widespread usage of computer-based assessments and individualized le...
research
10/08/2021

Field Extraction from Forms with Unlabeled Data

We propose a novel framework to conduct field extraction from forms with...

Please sign up or login with your details

Forgot password? Click here to reset