Utterance-Based Audio Sentiment Analysis Learned by a Parallel Combination of CNN and LSTM

11/20/2018
by   Ziqian Luo, et al.
0

Audio Sentiment Analysis is a popular research area which extends the conventional text-based sentiment analysis to depend on the effectiveness of acoustic features extracted from speech. However, current progress on audio sentiment analysis mainly focuses on extracting homogeneous acoustic features or doesn't fuse heterogeneous features effectively. In this paper, we propose an utterance-based deep neural network model, which has a parallel combination of Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) based network, to obtain representative features termed Audio Sentiment Vector (ASV), that can maximally reflect sentiment information in an audio. Specifically, our model is trained by utterance-level labels and ASV can be extracted and fused creatively from two branches. In the CNN model branch, spectrum graphs produced by signals are fed as inputs while in the LSTM model branch, inputs include spectral features and cepstrum coefficient extracted from dependent utterances in an audio. Besides, Bidirectional Long Short-Term Memory (BiLSTM) with attention mechanism is used for feature fusion. Extensive experiments have been conducted to show our model can recognize audio sentiment precisely and quickly, and demonstrate our ASV are better than traditional acoustic features or vectors extracted from other deep learning models. Furthermore, experimental results indicate that the proposed model outperforms the state-of-the-art approach by 9.33 (MOSI) dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2022

LSTM based models stability in the context of Sentiment Analysis for social media

Deep learning techniques have proven their effectiveness for Sentiment A...
research
02/20/2020

Audio-video Emotion Recognition in the Wild using Deep Hybrid Networks

This paper presents an audiovisual-based emotion recognition hybrid netw...
research
04/08/2019

Deep-Sentiment: Sentiment Analysis Using Ensemble of CNN and Bi-LSTM Models

With the popularity of social networks, and e-commerce websites, sentime...
research
09/02/2022

TB or not TB? Acoustic cough analysis for tuberculosis classification

In this work, we explore recurrent neural network architectures for tube...
research
03/21/2018

ρ-hot Lexicon Embedding-based Two-level LSTM for Sentiment Analysis

Sentiment analysis is a key component in various text mining application...
research
05/03/2020

Visualizing Deep Learning-based Radio Modulation Classifier

Deep learning has recently been successfully applied in automatic modula...
research
09/18/2019

Sentiment-Aware Recommendation System for Healthcare using Social Media

Over the last decade, health communities (known as forums) have evolved ...

Please sign up or login with your details

Forgot password? Click here to reset