Sentiment Analysis of Yelp Reviews: A Comparison of Techniques and Models

04/15/2020
by   Siqi Liu, et al.
0

We use over 350,000 Yelp reviews on 5,000 restaurants to perform an ablation study on text preprocessing techniques. We also compare the effectiveness of several machine learning and deep learning models on predicting user sentiment (negative, neutral, or positive). For machine learning models, we find that using binary bag-of-word representation, adding bi-grams, imposing minimum frequency constraints and normalizing texts have positive effects on model performance. For deep learning models, we find that using pre-trained word embeddings and capping maximum length often boost model performance. Finally, using macro F1 score as our comparison metric, we find simpler models such as Logistic Regression and Support Vector Machine to be more effective at predicting sentiments than more complex models such as Gradient Boosting, LSTM and BERT.

READ FULL TEXT
research
07/02/2020

Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey

The purpose of the study is to investigate the relative effectiveness of...
research
08/26/2021

Using GAN-based models to sentimental analysis on imbalanced datasets in education domain

While the whole world is still struggling with the COVID-19 pandemic, on...
research
04/06/2023

Deep Learning for Opinion Mining and Topic Classification of Course Reviews

Student opinions for a course are important to educators and administrat...
research
09/04/2022

Quantitative Stopword Generation for Sentiment Analysis via Recursive and Iterative Deletion

Stopwords carry little semantic information and are often removed from t...
research
11/05/2015

An Empirical Study on Sentiment Classification of Chinese Review using Word Embedding

In this article, how word embeddings can be used as features in Chinese ...
research
05/12/2019

A Comparison of Techniques for Sentiment Classification of Film Reviews

We undertake the task of comparing lexicon-based sentiment classificatio...
research
04/19/2023

Catch Me If You Can: Identifying Fraudulent Physician Reviews with Large Language Models Using Generative Pre-Trained Transformers

The proliferation of fake reviews of doctors has potentially detrimental...

Please sign up or login with your details

Forgot password? Click here to reset