Unsupervised Paraphrase Generation using Pre-trained Language Models

06/09/2020
by   Chaitra Hegde, et al.
0

Large scale Pre-trained Language Models have proven to be very powerful approach in various Natural language tasks. OpenAI's GPT-2 <cit.> is notable for its capability to generate fluent, well formulated, grammatically consistent text and for phrase completions. In this paper we leverage this generation capability of GPT-2 to generate paraphrases without any supervision from labelled data. We examine how the results compare with other supervised and unsupervised approaches and the effect of using paraphrases for data augmentation on downstream tasks such as classification. Our experiments show that paraphrases generated with our model are of good quality, are diverse and improves the downstream task performance when used for data augmentation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2022

DAGAM: Data Augmentation with Generation And Modification

Text classification is a representative downstream task of natural langu...
research
02/05/2023

Exploring Data Augmentation for Code Generation Tasks

Advances in natural language processing, such as transfer learning from ...
research
05/19/2022

Transformers as Neural Augmentors: Class Conditional Sentence Generation via Variational Bayes

Data augmentation methods for Natural Language Processing tasks are expl...
research
04/01/2020

Deep Entity Matching with Pre-Trained Language Models

We present Ditto, a novel entity matching system based on pre-trained Tr...
research
10/10/2022

Metaphorical Paraphrase Generation: Feeding Metaphorical Language Models with Literal Texts

This study presents a new approach to metaphorical paraphrase generation...
research
03/13/2023

Architext: Language-Driven Generative Architecture Design

Architectural design is a highly complex practice that involves a wide d...
research
02/14/2023

BLIAM: Literature-based Data Synthesis for Synergistic Drug Combination Prediction

Language models pre-trained on scientific literature corpora have substa...

Please sign up or login with your details

Forgot password? Click here to reset