HetEmotionNet: Two-Stream Heterogeneous Graph Recurrent Neural Network for Multi-modal Emotion Recognition

08/07/2021
by   Ziyu Jia, et al.
0

The research on human emotion under multimedia stimulation based on physiological signals is an emerging field, and important progress has been achieved for emotion recognition based on multi-modal signals. However, it is challenging to make full use of the complementarity among spatial-spectral-temporal domain features for emotion recognition, as well as model the heterogeneity and correlation among multi-modal signals. In this paper, we propose a novel two-stream heterogeneous graph recurrent neural network, named HetEmotionNet, fusing multi-modal physiological signals for emotion recognition. Specifically, HetEmotionNet consists of the spatial-temporal stream and the spatial-spectral stream, which can fuse spatial-spectral-temporal domain features in a unified framework. Each stream is composed of the graph transformer network for modeling the heterogeneity, the graph convolutional network for modeling the correlation, and the gated recurrent unit for capturing the temporal domain or spectral domain dependency. Extensive experiments on two real-world datasets demonstrate that our proposed model achieves better performance than state-of-the-art baselines.

READ FULL TEXT
research
09/09/2020

Multi-modal Attention for Speech Emotion Recognition

Emotion represents an essential aspect of human speech that is manifeste...
research
05/12/2017

Spatial-Temporal Recurrent Neural Network for Emotion Recognition

Emotion analysis is a crucial problem to endow artifact machines with re...
research
09/22/2018

Entropy-Assisted Multi-Modal Emotion Recognition Framework Based on Physiological Signals

As the result of the growing importance of the Human Computer Interface ...
research
12/11/2018

Face-Focused Cross-Stream Network for Deception Detection in Videos

Automated deception detection (ADD) from real-life videos is a challengi...
research
05/01/2023

Multi-scale Transformer-based Network for Emotion Recognition from Multi Physiological Signals

This paper presents an efficient Multi-scale Transformer-based approach ...
research
11/21/2019

MIMAMO Net: Integrating Micro- and Macro-motion for Video Emotion Recognition

Spatial-temporal feature learning is of vital importance for video emoti...

Please sign up or login with your details

Forgot password? Click here to reset