Face Pyramid Vision Transformer

10/21/2022
by   Khawar Islam, et al.
0

A novel Face Pyramid Vision Transformer (FPVT) is proposed to learn a discriminative multi-scale facial representations for face recognition and verification. In FPVT, Face Spatial Reduction Attention (FSRA) and Dimensionality Reduction (FDR) layers are employed to make the feature maps compact, thus reducing the computations. An Improved Patch Embedding (IPE) algorithm is proposed to exploit the benefits of CNNs in ViTs (e.g., shared weights, local context, and receptive fields) to model lower-level edges to higher-level semantic primitives. Within FPVT framework, a Convolutional Feed-Forward Network (CFFN) is proposed that extracts locality information to learn low level facial information. The proposed FPVT is evaluated on seven benchmark datasets and compared with ten existing state-of-the-art methods, including CNNs, pure ViTs, and Convolutional ViTs. Despite fewer parameters, FPVT has demonstrated excellent performance over the compared methods. Project page is available at https://khawar-islam.github.io/fpvt/

READ FULL TEXT
research
02/03/2023

DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition

As a de facto solution, the vanilla Vision Transformers (ViTs) are encou...
research
03/27/2021

Face Transformer for Recognition

Recently there has been great interests of Transformer not only in NLP b...
research
03/01/2019

Pyramid Feature Selective Network for Saliency detection

Saliency detection is one of the basic challenges in computer vision. Ho...
research
09/16/2021

TANet: A new Paradigm for Global Face Super-resolution via Transformer-CNN Aggregation Network

Recently, face super-resolution (FSR) methods either feed whole face ima...
research
08/30/2021

Exploring and Improving Mobile Level Vision Transformers

We study the vision transformer structure in the mobile level in this pa...
research
07/24/2023

Robust face anti-spoofing framework with Convolutional Vision Transformer

Owing to the advances in image processing technology and large-scale dat...
research
12/28/2013

Shape Primitive Histogram: A Novel Low-Level Face Representation for Face Recognition

We further exploit the representational power of Haar wavelet and presen...

Please sign up or login with your details

Forgot password? Click here to reset