Towards Accurate Facial Landmark Detection via Cascaded Transformers

08/23/2022
by   Hui Li, et al.
3

Accurate facial landmarks are essential prerequisites for many tasks related to human faces. In this paper, an accurate facial landmark detector is proposed based on cascaded transformers. We formulate facial landmark detection as a coordinate regression task such that the model can be trained end-to-end. With self-attention in transformers, our model can inherently exploit the structured relationships between landmarks, which would benefit landmark detection under challenging conditions such as large pose and occlusion. During cascaded refinement, our model is able to extract the most relevant image features around the target landmark for coordinate prediction, based on deformable attention mechanism, thus bringing more accurate alignment. In addition, we propose a novel decoder that refines image features and landmark positions simultaneously. With few parameter increasing, the detection performance improves further. Our model achieves new state-of-the-art performance on several standard facial landmark detection benchmarks, and shows good generalization ability in cross-dataset evaluation.

READ FULL TEXT

page 6

page 7

page 9

research
10/18/2020

Deep Structured Prediction for Facial Landmark Detection

Existing deep learning based facial landmark detection methods have achi...
research
07/08/2022

RePFormer: Refinement Pyramid Transformer for Robust Facial Landmark Detection

This paper presents a Refinement Pyramid Transformer (RePFormer) for rob...
research
11/26/2016

Convolutional Experts Constrained Local Model for Facial Landmark Detection

Constrained Local Models (CLMs) are a well-established family of methods...
research
10/09/2022

Less is More: Facial Landmarks can Recognize a Spontaneous Smile

Smile veracity classification is a task of interpreting social interacti...
research
03/19/2022

Multi-Domain Multi-Definition Landmark Localization for Small Datasets

We present a novel method for multi image domain and multi-landmark defi...
research
03/18/2018

Facial Landmarks Detection by Self-Iterative Regression based Landmarks-Attention Network

Cascaded Regression (CR) based methods have been proposed to solve facia...
research
02/04/2023

LipFormer: Learning to Lipread Unseen Speakers based on Visual-Landmark Transformers

Lipreading refers to understanding and further translating the speech of...

Please sign up or login with your details

Forgot password? Click here to reset