Multi-Channel Auto-Encoder for Speech Emotion Recognition

10/25/2018
by   Zefang Zong, et al.
0

Inferring emotion status from users' queries plays an important role to enhance the capacity in voice dialogues applications. Even though several related works obtained satisfactory results, the performance can still be further improved. In this paper, we proposed a novel framework named multi-channel auto-encoder (MTC-AE) on emotion recognition from acoustic information. MTC-AE contains multiple local DNNs based on different low-level descriptors with different statistics functions that are partly concatenated together, by which the structure is enabled to consider both local and global features simultaneously. Experiment based on a benchmark dataset IEMOCAP shows that our method significantly outperforms the existing state-of-the-art results, achieving 64.8% leave-one-speaker-out unweighted accuracy, which is 2.4% higher than the best result on this dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/21/2018

Speech Emotion Recognition Considering Local Dynamic Features

Recently, increasing attention has been directed to the study of the spe...
research
06/11/2019

Deep Learning based Emotion Recognition System Using Speech Features and Transcriptions

This paper proposes a speech emotion recognition method based on speech ...
research
10/30/2018

Deep Learning as Feature Encoding for Emotion Recognition

Deep learning is popular as an end-to-end framework extracting the promi...
research
06/02/2023

Learning Local to Global Feature Aggregation for Speech Emotion Recognition

Transformer has emerged in speech emotion recognition (SER) at present. ...
research
04/28/2020

Deep Auto-Encoders with Sequential Learning for Multimodal Dimensional Emotion Recognition

Multimodal dimensional emotion recognition has drawn a great attention f...
research
10/07/2021

SERAB: A multi-lingual benchmark for speech emotion recognition

Recent developments in speech emotion recognition (SER) often leverage d...
research
06/11/2018

Multilingual Scene Character Recognition System using Sparse Auto-Encoder for Efficient Local Features Representation in Bag of Features

The recognition of texts existing in camera-captured images has become a...

Please sign up or login with your details

Forgot password? Click here to reset