Dual-branch Cross-Patch Attention Learning for Group Affect Recognition

12/14/2022
by   Hongxia Xie, et al.
0

Group affect refers to the subjective emotion that is evoked by an external stimulus in a group, which is an important factor that shapes group behavior and outcomes. Recognizing group affect involves identifying important individuals and salient objects among a crowd that can evoke emotions. Most of the existing methods are proposed to detect faces and objects using pre-trained detectors and summarize the results into group emotions by specific rules. However, such affective region selection mechanisms are heuristic and susceptible to imperfect faces and objects from the pre-trained detectors. Moreover, faces and objects on group-level images are often contextually relevant. There is still an open question about how important faces and objects can be interacted with. In this work, we incorporate the psychological concept called Most Important Person (MIP). It represents the most noteworthy face in the crowd and has an affective semantic meaning. We propose the Dual-branch Cross-Patch Attention Transformer (DCAT) which uses global image and MIP together as inputs. Specifically, we first learn the informative facial regions produced by the MIP and the global context separately. Then, the Cross-Patch Attention module is proposed to fuse the features of MIP and global context together to complement each other. With parameters less than 10x, the proposed DCAT outperforms state-of-the-art methods on two datasets of group valence prediction, GAF 3.0 and GroupEmoW datasets. Moreover, our proposed model can be transferred to another group affect task, group cohesion, and shows comparable results.

READ FULL TEXT

page 1

page 2

page 8

research
05/22/2019

Oculum afficit: Ocular Affect Recognition

Recognizing human affect and emotions is a problem that has a wide range...
research
12/13/2016

Finding Tiny Faces

Though tremendous strides have been made in object recognition, one of t...
research
01/21/2023

REDAffectiveLM: Leveraging Affect Enriched Embedding and Transformer-based Neural Language Model for Readers' Emotion Detection

Technological advancements in web platforms allow people to express and ...
research
01/04/2022

Short Range Correlation Transformer for Occluded Person Re-Identification

Occluded person re-identification is one of the challenging areas of com...
research
03/09/2022

Region-Aware Face Swapping

This paper presents a novel Region-Aware Face Swapping (RAFSwap) network...
research
11/07/2021

Global-Local Attention for Emotion Recognition

Human emotion recognition is an active research area in artificial intel...

Please sign up or login with your details

Forgot password? Click here to reset