Quality Controlled Paraphrase Generation

03/21/2022
by   Elron Bandel, et al.
0

Paraphrase generation has been widely used in various downstream tasks. Most tasks benefit mainly from high quality paraphrases, namely those that are semantically similar to, yet linguistically diverse from, the original sentence. Generating high-quality paraphrases is challenging as it becomes increasingly hard to preserve meaning as linguistic diversity increases. Recent works achieve nice results by controlling specific aspects of the paraphrase, such as its syntactic tree. However, they do not allow to directly control the quality of the generated paraphrase, and suffer from low flexibility and scalability. Here we propose QCPG, a quality-guided controlled paraphrase generation model, that allows directly controlling the quality dimensions. Furthermore, we suggest a method that given a sentence, identifies points in the quality control space that are expected to yield optimal generated paraphrases. We show that our method is able to generate paraphrases which maintain the original meaning while achieving higher diversity than the uncontrolled baseline. The models, the code, and the data can be found in https://github.com/IBM/quality-controlled-paraphrase-generation.

READ FULL TEXT

page 3

page 7

page 14

research
11/02/2022

Unsupervised Syntactically Controlled Paraphrase Generation with Abstract Meaning Representations

Syntactically controlled paraphrase generation has become an emerging re...
research
09/11/2018

Assessing Composition in Sentence Vector Representations

An important component of achieving language understanding is mastering ...
research
09/04/2021

Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach

In recent years, neural paraphrase generation based on Seq2Seq has achie...
research
05/18/2020

Syntax-guided Controlled Generation of Paraphrases

Given a sentence (e.g., "I like mangoes") and a constraint (e.g., sentim...
research
01/26/2021

Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs

Paraphrase generation plays an essential role in natural language proces...
research
06/21/2023

Ambigram Generation by A Diffusion Model

Ambigrams are graphical letter designs that can be read not only from th...
research
01/01/2023

Optimizing Readability Using Genetic Algorithms

This research presents ORUGA, a method that tries to automatically optim...

Please sign up or login with your details

Forgot password? Click here to reset