Learning to Simplify with Data Hopelessly Out of Alignment

04/02/2022
by   Tadashi Nomoto, et al.
0

We consider whether it is possible to do text simplification without relying on a "parallel" corpus, one that is made up of sentence-by-sentence alignments of complex and ground truth simple sentences. To this end, we introduce a number of concepts, some new and some not, including what we call Conjoined Twin Networks, Flip-Flop Auto-Encoders (FFA) and Adversarial Networks (GAN). A comparison is made between Jensen-Shannon (JS-GAN) and Wasserstein GAN, to see how they impact performance, with stronger results for the former. An experiment we conducted with a large dataset derived from Wikipedia found the solid superiority of Twin Networks equipped with FFA and JS-GAN, over the current best performing system. Furthermore, we discuss where we stand in a relation to fully supervised methods in the past literature, and highlight with examples qualitative differences that exist among simplified sentences generated by supervision-free systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2019

Pun-GAN: Generative Adversarial Network for Pun Generation

In this paper, we focus on the task of generating a pun sentence given a...
research
05/05/2020

Neural CRF Model for Sentence Alignment in Text Simplification

The success of a text simplification system heavily depends on the quali...
research
04/07/2017

A Constrained Sequence-to-Sequence Neural Model for Sentence Simplification

Sentence simplification reduces semantic complexity to benefit people wi...
research
05/10/2023

WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia

Wikipedia can be edited by anyone and thus contains various quality sent...
research
04/16/2023

Syntactic Complexity Identification, Measurement, and Reduction Through Controlled Syntactic Simplification

Text simplification is one of the domains in Natural Language Processing...
research
04/20/2019

Personalized sentence generation using generative adversarial networks with author-specific word usage

The author-specific word usage is a vital feature to let readers perceiv...

Please sign up or login with your details

Forgot password? Click here to reset