Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models

11/21/2019
by   Zhiyun Lu, et al.
0

In this paper, we propose to use pre-trained features from end-to-end ASR models to solve the speech sentiment analysis problem as a down-stream task. We show that end-to-end ASR features, which integrate both acoustic and text information from speech, achieve promising results. We use RNN with self-attention as the sentiment classifier, which also provides an easy visualization through attention weights to help interpret model predictions. We use well benchmarked IEMOCAP dataset and a new large-scale sentiment analysis dataset SWBD-senti for evaluation. Our approach improves the-state-of-the-art accuracy on IEMOCAP from 66.6 SWBD-senti with more than 49,500 utterances.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset