Using GAN-based models to sentimental analysis on imbalanced datasets in education domain

08/26/2021
by   Ru Yang, et al.
0

While the whole world is still struggling with the COVID-19 pandemic, online learning and home office become more common. Many schools transfer their courses teaching to the online classroom. Therefore, it is significant to mine the students' feedback and opinions from their reviews towards studies so that both schools and teachers can know where they need to improve. This paper trains machine learning and deep learning models using both balanced and imbalanced datasets for sentiment classification. Two SOTA category-aware text generation GAN models: CatGAN and SentiGAN, are utilized to synthesize text used to balance the highly imbalanced dataset. Results on three datasets with different imbalance degree from distinct domains show that when using generated text to balance the dataset, the F1-score of machine learning and deep learning model on sentiment classification increases 2.79 indicate that the average growth degree for CR100k is higher than CR23k, the average growth degree for deep learning is more increased than machine learning algorithms, and the average growth degree for more complex deep learning models is more increased than simpler deep learning models in experiments.

READ FULL TEXT
research
04/15/2020

Sentiment Analysis of Yelp Reviews: A Comparison of Techniques and Models

We use over 350,000 Yelp reviews on 5,000 restaurants to perform an abla...
research
08/13/2017

Leveraging Sparse and Dense Feature Combinations for Sentiment Classification

Neural networks are one of the most popular approaches for many natural ...
research
01/27/2020

Performance Analysis and Comparison of Machine and Deep Learning Algorithms for IoT Data Classification

In recent years, the growth of Internet of Things (IoT) as an emerging t...
research
04/23/2023

Dependence of Physiochemical Features on Marine Chlorophyll Analysis with Learning Techniques

Marine chlorophyll which is present within phytoplankton are the basis o...
research
01/20/2022

Sentiment Analysis: Predicting Yelp Scores

In this work, we predict the sentiment of restaurant reviews based on a ...
research
05/06/2021

Tackling Imbalanced Data in Cybersecurity with Transfer Learning: A Case with ROP Payload Detection

In recent years, deep learning gained proliferating popularity in the cy...
research
05/05/2021

DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data

Despite over two decades of progress, imbalanced data is still considere...

Please sign up or login with your details

Forgot password? Click here to reset