Transformers as Neural Augmentors: Class Conditional Sentence Generation via Variational Bayes

05/19/2022
by   M. Şafak Bilici, et al.
0

Data augmentation methods for Natural Language Processing tasks are explored in recent years, however they are limited and it is hard to capture the diversity on sentence level. Besides, it is not always possible to perform data augmentation on supervised tasks. To address those problems, we propose a neural data augmentation method, which is a combination of Conditional Variational Autoencoder and encoder-decoder Transformer model. While encoding and decoding the input sentence, our model captures the syntactic and semantic representation of the input language with its class condition. Following the developments in the past years on pre-trained language models, we train and evaluate our models on several benchmarks to strengthen the downstream tasks. We compare our method with 3 different augmentation techniques. The presented results show that, our model increases the performance of current models compared to other data augmentation techniques with a small amount of computation power.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2020

Unsupervised Paraphrase Generation using Pre-trained Language Models

Large scale Pre-trained Language Models have proven to be very powerful ...
research
04/06/2022

DAGAM: Data Augmentation with Generation And Modification

Text classification is a representative downstream task of natural langu...
research
06/15/2022

BaIT: Barometer for Information Trustworthiness

This paper presents a new approach to the FNC-1 fake news classification...
research
06/13/2023

Rethink the Effectiveness of Text Data Augmentation: An Empirical Analysis

In recent years, language models (LMs) have made remarkable progress in ...
research
10/07/2022

UU-Tax at SemEval-2022 Task 3: Improving the generalizability of language models for taxonomy classification through data augmentation

This paper presents our strategy to address the SemEval-2022 Task 3 PreT...
research
12/16/2022

Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization

Machine learning models can reach high performance on benchmark natural ...
research
03/15/2022

Adversarial Counterfactual Augmentation: Application in Alzheimer's Disease Classification

Data augmentation has been widely used in deep learning to reduce over-f...

Please sign up or login with your details

Forgot password? Click here to reset