An Empirical Study on Sentiment Classification of Chinese Review using Word Embedding

11/05/2015
by   Yiou Lin, et al.
0

In this article, how word embeddings can be used as features in Chinese sentiment classification is presented. Firstly, a Chinese opinion corpus is built with a million comments from hotel review websites. Then the word embeddings which represent each comment are used as input in different machine learning methods for sentiment classification, including SVM, Logistic Regression, Convolutional Neural Network (CNN) and ensemble methods. These methods get better performance compared with N-gram models using Naive Bayes (NB) and Maximum Entropy (ME). Finally, a combination of machine learning methods is proposed which presents an outstanding performance in precision, recall and F1 score. After selecting the most useful methods to construct the combinational model and testing over the corpus, the final F1 score is 0.920.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2020

A Precisely Xtreme-Multi Channel Hybrid Approach For Roman Urdu Sentiment Analysis

In order to accelerate the performance of various Natural Language Proce...
research
01/08/2021

Effect of Word Embedding Variable Parameters on Arabic Sentiment Analysis Performance

Social media such as Twitter, Facebook, etc. has led to a generated grow...
research
04/08/2021

Machine Learning Based on Natural Language Processing to Detect Cardiac Failure in Clinical Narratives

The purpose of the study presented herein is to develop a machine learni...
research
06/20/2018

Opinion Dynamics Modeling for Movie Review Transcripts Classification with Hidden Conditional Random Fields

In this paper, the main goal is to detect a movie reviewer's opinion usi...
research
01/01/2023

Is word segmentation necessary for Vietnamese sentiment classification?

To the best of our knowledge, this paper made the first attempt to answe...
research
01/02/2022

Succinct Differentiation of Disparate Boosting Ensemble Learning Methods for Prognostication of Polycystic Ovary Syndrome Diagnosis

Prognostication of medical problems using the clinical data by leveragin...
research
04/15/2020

Sentiment Analysis of Yelp Reviews: A Comparison of Techniques and Models

We use over 350,000 Yelp reviews on 5,000 restaurants to perform an abla...

Please sign up or login with your details

Forgot password? Click here to reset