RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment

01/17/2018
by   Xi Peng, et al.
0

We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model. Our proposed model predicts 2D facial point heat maps regularized by both detection and regression loss, while uniquely exploiting recurrent learning at both spatial and temporal dimensions. At the spatial level, we add a feedback loop connection between the combined output response map and the input, in order to enable iterative coarse-to-fine face alignment using a single network model, instead of relying on traditional cascaded model ensembles. At the temporal level, we first decouple the features in the bottleneck of the network into temporal-variant factors, such as pose and expression, and temporal-invariant factors, such as identity information. Temporal recurrent learning is then applied to the decoupled temporal-variant features. We show that such feature disentangling yields better generalization and significantly more accurate results at test time. We perform a comprehensive experimental analysis, showing the importance of each component of our proposed model, as well as superior results over the state of the art and several variations of our method in standard datasets.

READ FULL TEXT

page 11

page 13

research
08/19/2016

A Recurrent Encoder-Decoder Network for Sequential Face Alignment

We propose a novel recurrent encoder-decoder network model for real-time...
research
12/06/2016

Video Ladder Networks

We present the Video Ladder Network (VLN) for efficiently generating fut...
research
02/04/2022

Multi-task head pose estimation in-the-wild

We present a deep learning-based multi-task approach for head pose estim...
research
05/08/2019

Deep Blind Video Decaptioning by Temporal Aggregation and Recurrence

Blind video decaptioning is a problem of automatically removing text ove...
research
05/22/2017

TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation

Action segmentation as a milestone towards building automatic systems to...
research
11/19/2019

Live Face De-Identification in Video

We propose a method for face de-identification that enables fully automa...
research
07/03/2022

Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities?

Recognising continuous emotions and action unit (AU) intensities from fa...

Please sign up or login with your details

Forgot password? Click here to reset