Evaluating Gammatone Frequency Cepstral Coefficients with Neural Networks for Emotion Recognition from Speech

06/23/2018
by   Gabrielle K. Liu, et al.
0

Current approaches to speech emotion recognition focus on speech features that can capture the emotional content of a speech signal. Mel Frequency Cepstral Coefficients (MFCCs) are one of the most commonly used representations for audio speech recognition and classification. This paper proposes Gammatone Frequency Cepstral Coefficients (GFCCs) as a potentially better representation of speech signals for emotion recognition. The effectiveness of MFCC and GFCC representations are compared and evaluated over emotion and intensity classification tasks with fully connected and recurrent neural network architectures. The results provide evidence that GFCCs outperform MFCCs in speech emotion recognition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2019

Deep Learning based Emotion Recognition System Using Speech Features and Transcriptions

This paper proposes a speech emotion recognition method based on speech ...
research
12/22/2019

Emotion Recognition from Speech

In this work, we conduct an extensive comparison of various approaches t...
research
03/30/2018

Reusing Neural Speech Representations for Auditory Emotion Recognition

Acoustic emotion recognition aims to categorize the affective state of t...
research
07/28/2022

Extending RNN-T-based speech recognition systems with emotion and language classification

Speech transcription, emotion recognition, and language identification a...
research
06/11/2019

Focal Loss based Residual Convolutional Neural Network for Speech Emotion Recognition

This paper proposes a Residual Convolutional Neural Network (ResNet) bas...
research
10/31/2021

Speech Emotion Recognition Using Quaternion Convolutional Neural Networks

Although speech recognition has become a widespread technology, inferrin...
research
03/06/2020

Multi-Time-Scale Convolution for Emotion Recognition from Speech Audio Signals

Robustness against temporal variations is important for emotion recognit...

Please sign up or login with your details

Forgot password? Click here to reset