CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via Cycle Training

06/08/2020
by   Qipeng Guo, et al.
10

Two important tasks at the intersection of knowledge graphs and natural language processing are graph-to-text (G2T) and text-to-graph (T2G) conversion. Due to the difficulty and high cost of data collection, the supervised data available in the two fields are usually on the magnitude of tens of thousands, for example, 18K in the WebNLG dataset, which is far fewer than the millions of data for other tasks such as machine translation. Consequently, deep learning models in these two fields suffer largely from scarce training data. This work presents the first attempt to unsupervised learning of T2G and G2T via cycle training. We present CycleGT, an unsupervised training framework that can bootstrap from fully non-parallel graph and text datasets, iteratively back translate between the two forms, and use a novel pretraining strategy. Experiments on the benchmark WebNLG dataset show that, impressively, our unsupervised model trained on the same amount of data can achieve performance on par with the supervised models. This validates our framework as an effective approach to overcome the data scarcity problem in the fields of G2T and T2G.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2022

A multi-task semi-supervised framework for Text2Graph Graph2Text

The Artificial Intelligence industry regularly develops applications tha...
research
09/22/2022

INFINITY: A Simple Yet Effective Unsupervised Framework for Graph-Text Mutual Conversion

Graph-to-text (G2T) generation and text-to-graph (T2G) triple extraction...
research
04/20/2019

Unsupervised Text Generation from Structured Data

This work presents a joint solution to two challenging tasks: text gener...
research
05/21/2023

PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs

Large language models (LLMs) have shown great abilities of solving vario...
research
03/01/2018

Matching Natural Language Sentences with Hierarchical Sentence Factorization

Semantic matching of natural language sentences or identifying the relat...
research
10/29/2018

Unsupervised Data Selection for Supervised Learning

Recent research put a big effort in the development of deep learning arc...
research
06/07/2021

Unsupervised Representation Disentanglement of Text: An Evaluation on Synthetic Datasets

To highlight the challenges of achieving representation disentanglement ...

Please sign up or login with your details

Forgot password? Click here to reset