Landmark Guidance Independent Spatio-channel Attention and Complementary Context Information based Facial Expression Recognition

07/20/2020
by   Darshan Gera, et al.
8

A recent trend to recognize facial expressions in the real-world scenario is to deploy attention based convolutional neural networks (CNNs) locally to signify the importance of facial regions and, combine it with global facial features and/or other complementary context information for performance gain. However, in the presence of occlusions and pose variations, different channels respond differently, and further that the response intensity of a channel differ across spatial locations. Also, modern facial expression recognition(FER) architectures rely on external sources like landmark detectors for defining attention. Failure of landmark detector will have a cascading effect on FER. Additionally, there is no emphasis laid on the relevance of features that are input to compute complementary context information. Leveraging on the aforementioned observations, an end-to-end architecture for FER is proposed in this work that obtains both local and global attention per channel per spatial location through a novel spatio-channel attention net (SCAN), without seeking any information from the landmark detectors. SCAN is complemented by a complementary context information (CCI) branch. Further, using efficient channel attention (ECA), the relevance of features input to CCI is also attended to. The representation learnt by the proposed architecture is robust to occlusions and pose variations. Robustness and superior performance of the proposed model is demonstrated on both in-lab and in-the-wild datasets (AffectNet, FERPlus, RAF-DB, FED-RO, SFEW, CK+, Oulu-CASIA and JAFFE) along with a couple of constructed face mask datasets resembling masked faces in COVID-19 scenario. Codes will be made publicly available.

READ FULL TEXT

page 4

page 13

page 17

page 18

page 19

research
09/29/2020

Affect Expression Behaviour Analysis in the Wild using Spatio-Channel Attention and Complementary Context Information

Facial expression recognition(FER) in the wild is crucial for building r...
research
03/28/2021

Imponderous Net for Facial Expression Recognition in the Wild

Since the renaissance of deep learning (DL), facial expression recogniti...
research
05/10/2019

Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition

Occlusion and pose variations, which can change facial appearance signif...
research
05/05/2023

LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition

Previous methods for dynamic facial expression recognition (DFER) in the...
research
03/31/2021

Robust Facial Expression Recognition with Convolutional Visual Transformers

Facial Expression Recognition (FER) in the wild is extremely challenging...
research
12/23/2021

Robust and Precise Facial Landmark Detection by Self-Calibrated Pose Attention Network

Current fully-supervised facial landmark detection methods have progress...
research
02/28/2019

PFLD: A Practical Facial Landmark Detector

Being accurate, efficient, and compact is essential to a facial landmark...

Please sign up or login with your details

Forgot password? Click here to reset