CLIPER: A Unified Vision-Language Framework for In-the-Wild Facial Expression Recognition

03/01/2023
by   Hanting Li, et al.
0

Facial expression recognition (FER) is an essential task for understanding human behaviors. As one of the most informative behaviors of humans, facial expressions are often compound and variable, which is manifested by the fact that different people may express the same expression in very different ways. However, most FER methods still use one-hot or soft labels as the supervision, which lack sufficient semantic descriptions of facial expressions and are less interpretable. Recently, contrastive vision-language pre-training (VLP) models (e.g., CLIP) use text as supervision and have injected new vitality into various computer vision tasks, benefiting from the rich semantics in text. Therefore, in this work, we propose CLIPER, a unified framework for both static and dynamic facial Expression Recognition based on CLIP. Besides, we introduce multiple expression text descriptors (METD) to learn fine-grained expression representations that make CLIPER more interpretable. We conduct extensive experiments on several popular FER benchmarks and achieve state-of-the-art performance, which demonstrates the effectiveness of CLIPER.

READ FULL TEXT

page 1

page 7

research
01/17/2020

Learning to Augment Expressions for Few-shot Fine-grained Facial Expression Recognition

Affective computing and cognitive theory are widely used in modern human...
research
10/07/2016

Learning Grimaces by Watching TV

Differently from computer vision systems which require explicit supervis...
research
01/08/2021

Unobtrusive Pain Monitoring in Older Adults with Dementia using Pairwise and Contrastive Training

Although pain is frequent in old age, older adults are often undertreate...
research
07/22/2022

Facial Expression Recognition using Vanilla ViT backbones with MAE Pretraining

Humans usually convey emotions voluntarily or involuntarily by facial ex...
research
08/25/2023

Prompting Visual-Language Models for Dynamic Facial Expression Recognition

This paper presents a novel visual-language model called DFER-CLIP, whic...
research
09/21/2016

From Facial Expression Recognition to Interpersonal Relation Prediction

Interpersonal relation defines the association, e.g., warm, friendliness...
research
07/18/2023

LA-Net: Landmark-Aware Learning for Reliable Facial Expression Recognition under Label Noise

Facial expression recognition (FER) remains a challenging task due to th...

Please sign up or login with your details

Forgot password? Click here to reset