Data augmentation for low resource sentiment analysis using generative adversarial networks

02/18/2019
by   Rahul Gupta, et al.
0

Sentiment analysis is a task that may suffer from a lack of data in certain cases, as the datasets are often generated and annotated by humans. In cases where data is inadequate for training discriminative models, generate models may aid training via data augmentation. Generative Adversarial Networks (GANs) are one such model that has advanced the state of the art in several tasks, including as image and text generation. In this paper, I train GAN models on low resource datasets, then use them for the purpose of data augmentation towards improving sentiment classifier generalization. Given the constraints of limited data, I explore various techniques to train the GAN models. I also present an analysis of the quality of generated GAN data as more training data for the GAN is made available. In this analysis, the generated data is evaluated as a test set (against a model trained on real data points) as well as a training set to train classification models. Finally, I also conduct a visual analysis by projecting the generated and the real data into a two-dimensional space using the t-Distributed Stochastic Neighbor Embedding (t-SNE) method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/13/2022

Improving Astronomical Time-series Classification via Data Augmentation with Generative Adversarial Networks

Due to the latest advances in technology, telescopes with significant sk...
research
04/18/2023

TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models

Data augmentation has been established as an efficacious approach to sup...
research
12/09/2020

Generative Adversarial Networks for Annotated Data Augmentation in Data Sparse NLU

Data sparsity is one of the key challenges associated with model develop...
research
04/19/2019

Data Augmentation Using GANs

In this paper we propose the use of Generative Adversarial Networks (GAN...
research
05/03/2022

Assessing Dataset Bias in Computer Vision

A biased dataset is a dataset that generally has attributes with an unev...
research
10/26/2022

Modeling the Graphotactics of Low-Resource Languages Using Sequential GANs

Generative Adversarial Networks (GANs) have been shown to aid in the cre...
research
04/29/2023

LD-GAN: Low-Dimensional Generative Adversarial Network for Spectral Image Generation with Variance Regularization

Deep learning methods are state-of-the-art for spectral image (SI) compu...

Please sign up or login with your details

Forgot password? Click here to reset