Towards Pose-invariant Lip-Reading

11/14/2019
by   Shiyang Cheng, et al.
19

Lip-reading models have been significantly improved recently thanks to powerful deep learning architectures. However, most works focused on frontal or near frontal views of the mouth. As a consequence, lip-reading performance seriously deteriorates in non-frontal mouth views. In this work, we present a framework for training pose-invariant lip-reading models on synthetic data instead of collecting and annotating non-frontal data which is costly and tedious. The proposed model significantly outperforms previous approaches on non-frontal views while retaining the superior performance on frontal and near frontal mouth views. Specifically, we propose to use a 3D Morphable Model (3DMM) to augment LRW, an existing large-scale but mostly frontal dataset, by generating synthetic facial data in arbitrary poses. The newly derived dataset, is used to train a state-of-the-art neural network for lip-reading. We conducted a cross-database experiment for isolated word recognition on the LRS2 dataset, and reported an absolute improvement of 2.55 proposed approach becomes clearer in extreme poses where an absolute improvement of up to 20.64

READ FULL TEXT
research
07/13/2023

Improving 2D Human Pose Estimation across Unseen Camera Views with Synthetic Data

Human Pose Estimation is a thoroughly researched problem; however, most ...
research
01/23/2020

Lipreading using Temporal Convolutional Networks

Lip-reading has attracted a lot of research attention lately thanks to a...
research
06/28/2021

Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human Digitization

The performance of supervised deep learning algorithms depends significa...
research
12/09/2017

SPP-Net: Deep Absolute Pose Regression with Synthetic Views

Image based localization is one of the important problems in computer vi...
research
11/15/2020

Learn an Effective Lip Reading Model without Pains

Lip reading, also known as visual speech recognition, aims to recognize ...
research
09/03/2022

Training Strategies for Improved Lip-reading

Several training strategies and temporal models have been recently propo...
research
01/08/2022

Image-based Automatic Dial Meter Reading in Unconstrained Scenarios

The replacement of analog meters with smart meters is costly, laborious,...

Please sign up or login with your details

Forgot password? Click here to reset