Synthetic Text Generation using Hypergraph Representations

09/06/2023
by   Natraj Raman, et al.
0

Generating synthetic variants of a document is often posed as text-to-text transformation. We propose an alternate LLM based method that first decomposes a document into semantic frames and then generates text using this interim sparse format. The frames are modeled using a hypergraph, which allows perturbing the frame contents in a principled manner. Specifically, new hyperedges are mined through topological analysis and complex polyadic relationships including hierarchy and temporal dynamics are accommodated. We show that our solution generates documents that are diverse, coherent and vary in style, sentiment, format, composition and facts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2020

Citation Text Generation

We introduce the task of citation text generation: given a pair of scien...
research
11/15/2018

Automatic Text Document Summarization using Semantic-based Analysis

Since the advent of the web, the amount of data on wen has been increase...
research
03/28/2023

Synthetically generated text for supervised text analysis

Supervised text models are a valuable tool for political scientists but ...
research
05/31/2017

Does the Geometry of Word Embeddings Help Document Classification? A Case Study on Persistent Homology Based Representations

We investigate the pertinence of methods from algebraic topology for tex...
research
09/06/2021

STRIVE: Scene Text Replacement In Videos

We propose replacing scene text in videos using deep style transfer and ...
research
03/13/2021

Lightweight Selective Disclosure for Verifiable Documents on Blockchain

To achieve lightweight selective disclosure for protecting privacy of do...

Please sign up or login with your details

Forgot password? Click here to reset