An Ensemble Framework of Voice-Based Emotion Recognition System for Films and TV Programs

03/03/2018
by   Fei Tao, et al.
0

Employing voice-based emotion recognition function in artificial intelligence (AI) product will improve the user experience. Most of researches that have been done only focus on the speech collected under controlled conditions. The scenarios evaluated in these research were well controlled. The conventional approach may fail when background noise or nonspeech filler exist. In this paper, we propose an ensemble framework combining several aspects of features from audio. The framework incorporates gender and speaker information relying on multi-task learning. Therefore it is able to dig and capture emotional information as much as possible. This framework is evaluated on multimodal emotion challenge (MEC) 2017 corpus which is close to real world. The proposed framework outperformed the best baseline system by 29.5 improvement).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/18/2020

Cross Lingual Cross Corpus Speech Emotion Recognition

The majority of existing speech emotion recognition models are trained a...
research
08/13/2017

Towards Speech Emotion Recognition "in the wild" using Aggregated Corpora and Deep Multi-Task Learning

One of the challenges in Speech Emotion Recognition (SER) "in the wild" ...
research
12/06/2019

What Do You Mean I'm Funny? Personalizing the Joke Skill of a Voice-Controlled Virtual Assistant

A considerable part of the success experienced by Voice-controlled virtu...
research
09/27/2017

Research on several key technologies in practical speech emotion recognition

In this dissertation the practical speech emotion recognition technology...
research
08/12/2018

Multimodal Local-Global Ranking Fusion for Emotion Recognition

Emotion recognition is a core research area at the intersection of artif...
research
09/22/2019

On Controlled DeEntanglement for Natural Language Processing

Latest addition to the toolbox of human species is Artificial Intelligen...
research
11/29/2018

Two-level Attention with Two-stage Multi-task Learning for Facial Emotion Recognition

Compared with facial emotion recognition on categorical model, the dimen...

Please sign up or login with your details

Forgot password? Click here to reset