Fine-grained Attention-based Video Face Recognition

05/06/2019
by   Zhaoxiang Liu, et al.
0

This paper aims to learn a compact representation of a video for video face recognition task. We make the following contributions: first, we propose a meta attention-based aggregation scheme which adaptively and fine-grained weighs the feature along each feature dimension among all frames to form a compact and discriminative representation. It makes the best to exploit the valuable or discriminative part of each frame to promote the performance of face recognition, without discarding or despising low quality frames as usual methods do. Second, we build a feature aggregation network comprised of a feature embedding module and a feature aggregation module. The embedding module is a convolutional neural network used to extract a feature vector from a face image, while the aggregation module consists of cascaded two meta attention blocks which adaptively aggregate the feature vectors into a single fixed-length representation. The network can deal with arbitrary number of frames, and is insensitive to frame order. Third, we validate the performance of proposed aggregation scheme. Experiments on publicly available datasets, such as YouTube face dataset and IJB-A dataset, show the effectiveness of our method, and it achieves competitive performances on both the verification and identification protocols.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/17/2016

Neural Aggregation Network for Video Face Recognition

This paper presents a Neural Aggregation Network (NAN) for video face re...
research
10/11/2020

Self-attention aggregation network for video face representation and recognition

Models based on self-attention mechanisms have been successful in analyz...
research
06/29/2019

frame attention networks for facial expression recognition in videos

The video-based facial expression recognition aims to classify a given v...
research
04/26/2019

Recurrent Embedding Aggregation Network for Video Face Recognition

Recurrent networks have been successful in analyzing temporal data and h...
research
08/13/2023

Improving Face Recognition from Caption Supervision with Multi-Granular Contextual Feature Aggregation

We introduce caption-guided face recognition (CGFR) as a new framework t...
research
10/26/2018

Fine-grained Video Categorization with Redundancy Reduction Attention

For fine-grained categorization tasks, videos could serve as a better so...
research
03/22/2016

Input Aggregated Network for Face Video Representation

Recently, deep neural network has shown promising performance in face im...

Please sign up or login with your details

Forgot password? Click here to reset