Attended End-to-end Architecture for Age Estimation from Facial Expression Videos

11/23/2017
by   Wenjie Pei, et al.
0

The main challenges of age estimation from facial expression videos lie not only in the modeling of the static facial appearance, but also in the capturing of the temporal facial dynamics. Traditional techniques to this problem focus on constructing handcrafted features to explore the discriminative information contained in facial appearance and dynamics separately. This relies on sophisticated feature-refinement and framework-design. In this paper, we present an end-to-end architecture for age estimation which is able to simultaneously learn both the appearance and dynamics of age from raw videos of facial expressions. Specifically, we employ convolutional neural networks to extract effective latent appearance representations and feed them into recurrent networks to model the temporal dynamics. More importantly, we propose to leverage attention models for salience detection in both the spatial domain for each single image and the temporal domain for the whole video as well. We design a specific spatially-indexed attention mechanism among the convolutional layers to extract the salient facial regions in each individual image, and a temporal attention layer to assign attention weights to each frame. This two-pronged approach not only improves the performance by allowing the model to focus on informative frames and facial areas, but it also offers an interpretable correspondence between the spatial facial regions as well as temporal frames, and the task of age estimation. We demonstrate the strong performance of our model in experiments on a large, gender-balanced database with 400 subjects with ages spanning from 8 to 76 years. Experiments reveal that our model exhibits significant superiority over the state-of-the-art methods given sufficient training data.

READ FULL TEXT

page 7

page 9

page 10

research
06/29/2019

frame attention networks for facial expression recognition in videos

The video-based facial expression recognition aims to classify a given v...
research
10/18/2021

Domain Generalisation for Apparent Emotional Facial Expression Recognition across Age-Groups

Apparent emotional facial expression recognition has attracted a lot of ...
research
03/05/2015

Deep Temporal Appearance-Geometry Network for Facial Expression Recognition

Temporal information can provide useful features for recognizing facial ...
research
12/14/2022

Blood Oxygen Saturation Estimation from Facial Video via DC and AC components of Spatio-temporal Map

Peripheral blood oxygen saturation (SpO2), an indicator of oxygen levels...
research
09/21/2021

Towards a Real-Time Facial Analysis System

Facial analysis is an active research area in computer vision, with many...
research
11/29/2017

Interpretable Facial Relational Network Using Relational Importance

Human face analysis is an important task in computer vision. According t...
research
10/11/2019

Aff-Wild Database and AffWildNet

In the context of HCI, building an automatic system to recognize affect ...

Please sign up or login with your details

Forgot password? Click here to reset