Multitask Multi-database Emotion Recognition

07/08/2021
by   Manh Tu Vu, et al.
0

In this work, we introduce our submission to the 2nd Affective Behavior Analysis in-the-wild (ABAW) 2021 competition. We train a unified deep learning model on multi-databases to perform two tasks: seven basic facial expressions prediction and valence-arousal estimation. Since these databases do not contains labels for all the two tasks, we have applied the distillation knowledge technique to train two networks: one teacher and one student model. The student model will be trained using both ground truth labels and soft labels derived from the pretrained teacher model. During the training, we add one more task, which is the combination of the two mentioned tasks, for better exploiting inter-task correlations. We also exploit the sharing videos between the two tasks of the AffWild2 database that is used in the competition, to further improve the performance of the network. Experiment results shows that the network have achieved promising results on the validation set of the AffWild2 database. Code and pretrained model are publicly available at https://github.com/glmanhtu/multitask-abaw-2021

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2020

FAU, Facial Expressions, Valence and Arousal: A Multi-task Solution

In the paper, we aim to train a unified model that performs three tasks:...
research
02/10/2020

Multitask Emotion Recognition with Incomplete Labels

We train a unified model to perform three tasks: facial action unit dete...
research
07/08/2021

Feature Pyramid Network for Multi-task Affective Analysis

Affective Analysis is not a single task, and the valence-arousal value, ...
research
03/24/2022

Multitask Emotion Recognition Model with Knowledge Distillation and Task Discriminator

Due to the collection of big data and the development of deep learning, ...
research
02/09/2020

Two-Stream Aural-Visual Affect Analysis in the Wild

In this work we introduce our submission to the Affective Behavior Analy...
research
02/06/2023

Audio Representation Learning by Distilling Video as Privileged Information

Deep audio representation learning using multi-modal audio-visual data o...
research
07/09/2021

Seven Basic Expression Recognition Using ResNet-18

We propose to use a ResNet-18 architecture that was pre-trained on the F...

Please sign up or login with your details

Forgot password? Click here to reset