EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning

04/03/2018
by   Egor Lakomkin, et al.
0

Acoustically expressed emotions can make communication with a robot more efficient. Detecting emotions like anger could provide a clue for the robot indicating unsafe/undesired situations. Recently, several deep neural network-based models have been proposed which establish new state-of-the-art results in affective state evaluation. These models typically start processing at the end of each utterance, which not only requires a mechanism to detect the end of an utterance but also makes it difficult to use them in a real-time communication scenario, e.g. human-robot interaction. We propose the EmoRL model that triggers an emotion classification as soon as it gains enough confidence while listening to a person speaking. As a result, we minimize the need for segmenting the audio signal for classification and achieve lower latency as the audio signal is processed incrementally. The method is competitive with the accuracy of a strong baseline model, while allowing much earlier prediction.

READ FULL TEXT
research
03/30/2020

iCub: Learning Emotion Expressions using Human Reward

The purpose of the present study is to learn emotion expression represen...
research
01/03/2023

e-Inu: Simulating A Quadruped Robot With Emotional Sentience

Quadruped robots are currently used in industrial robotics as mechanical...
research
01/12/2020

Hyperparameters optimization for Deep Learning based emotion prediction for Human Robot Interaction

To enable humanoid robots to share our social space we need to develop t...
research
07/01/2016

Fractal Dimension Pattern Based Multiresolution Analysis for Rough Estimator of Person-Dependent Audio Emotion Recognition

As a general means of expression, audio analysis and recognition has att...
research
04/06/2018

On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks

Speech emotion recognition (SER) is an important aspect of effective hum...
research
04/25/2022

Real-time Speech Emotion Recognition Based on Syllable-Level Feature Extraction

Speech emotion recognition systems have high prediction latency because ...
research
07/05/2019

Jointly Aligning and Predicting Continuous Emotion Annotations

Time-continuous dimensional descriptions of emotions (e.g., arousal, val...

Please sign up or login with your details

Forgot password? Click here to reset