Improving Bi-LSTM Performance for Indonesian Sentiment Analysis Using Paragraph Vector

09/12/2020
by   Ayu Purwarianti, et al.
0

Bidirectional Long Short-Term Memory Network (Bi-LSTM) has shown promising performance in sentiment classification task. It processes inputs as sequence of information. Due to this behavior, sentiment predictions by Bi-LSTM were influenced by words sequence and the first or last phrases of the texts tend to have stronger features than other phrases. Meanwhile, in the problem scope of Indonesian sentiment analysis, phrases that express the sentiment of a document might not appear in the first or last part of the document that can lead to incorrect sentiment classification. To this end, we propose the using of an existing document representation method called paragraph vector as additional input features for Bi-LSTM. This vector provides information context of the document for each sequence processing. The paragraph vector is simply concatenated to each word vector of the document. This representation also helps to differentiate ambiguous Indonesian words. Bi-LSTM and paragraph vector were previously used as separate methods. Combining the two methods has shown a significant performance improvement of Indonesian sentiment analysis model. Several case studies on testing data showed that the proposed method can handle the sentiment phrases position problem encountered by Bi-LSTM.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2022

LSTM based models stability in the context of Sentiment Analysis for social media

Deep learning techniques have proven their effectiveness for Sentiment A...
research
10/17/2016

Cached Long Short-Term Memory Neural Networks for Document-Level Sentiment Classification

Recently, neural networks have achieved great success on sentiment class...
research
11/27/2018

Document classification using a Bi-LSTM to unclog Brazil's supreme court

The Brazilian court system is currently the most clogged up judiciary sy...
research
01/16/2018

Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs

The driving force behind the recent success of LSTMs has been their abil...
research
12/11/2015

Words are not Equal: Graded Weighting Model for building Composite Document Vectors

Despite the success of distributional semantics, composing phrases from ...
research
07/03/2019

Deep neural network-based classification model for Sentiment Analysis

The growing prosperity of social networks has brought great challenges t...
research
12/05/2017

The Effect of Negators, Modals, and Degree Adverbs on Sentiment Composition

Negators, modals, and degree adverbs can significantly affect the sentim...

Please sign up or login with your details

Forgot password? Click here to reset