Deep Learning Face Attributes in the Wild

11/28/2014
by   Ziwei Liu, et al.
0

Predicting face attributes in the wild is challenging due to complex face variations. We propose a novel deep learning framework for attribute prediction in the wild. It cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently. LNet is pre-trained by massive general object categories for face localization, while ANet is pre-trained by massive face identities for attribute prediction. This framework not only outperforms the state-of-the-art with a large margin, but also reveals valuable facts on learning face representation. (1) It shows how the performances of face localization (LNet) and attribute prediction (ANet) can be improved by different pre-training strategies. (2) It reveals that although the filters of LNet are fine-tuned only with image-level attribute tags, their response maps over entire images have strong indication of face locations. This fact enables training LNet for face localization with only image-level annotations, but without face bounding boxes or landmarks, which are required by all attribute recognition works. (3) It also demonstrates that the high-level hidden neurons of ANet automatically discover semantic concepts after pre-training with massive face identities, and such concepts are significantly enriched after fine-tuning with attribute tags. Each attribute can be well explained with a sparse linear combination of these concepts.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

page 11

research
02/04/2016

Leveraging Mid-Level Deep Representations For Predicting Face Attributes in the Wild

Predicting facial attributes from faces in the wild is very challenging ...
research
02/12/2016

Face Attribute Prediction Using Off-the-Shelf CNN Features

Predicting attributes from face images in the wild is a challenging comp...
research
04/21/2016

Walk and Learn: Facial Attribute Representation Learning from Egocentric Video and Contextual Data

The way people look in terms of facial attributes (ethnicity, hair color...
research
06/03/2015

One-to-many face recognition with bilinear CNNs

The recent explosive growth in convolutional neural network (CNN) resear...
research
09/18/2023

Image-Text Pre-Training for Logo Recognition

Open-set logo recognition is commonly solved by first detecting possible...
research
11/30/2021

CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning

For video captioning, "pre-training and fine-tuning" has become a de fac...
research
09/21/2016

FaceNet2ExpNet: Regularizing a Deep Face Recognition Net for Expression Recognition

Relatively small data sets available for expression recognition research...

Please sign up or login with your details

Forgot password? Click here to reset