Neural Pipeline for Zero-Shot Data-to-Text Generation

03/30/2022
by   Zdeněk Kasner, et al.
0

In data-to-text (D2T) generation, training on in-domain data leads to overfitting to the data representation and repeating training data noise. We examine how to avoid finetuning pretrained language models (PLMs) on D2T generation datasets while still taking advantage of surface realization capabilities of PLMs. Inspired by pipeline approaches, we propose to generate text by transforming single-item descriptions with a sequence of modules trained on general-domain text-based operations: ordering, aggregation, and paragraph compression. We train PLMs for performing these operations on a synthetic corpus WikiFluent which we build from English Wikipedia. Our experiments on two major triple-to-text datasets – WebNLG and E2E – show that our approach enables D2T generation from RDF triples in zero-shot settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/22/2020

Few-Shot Text Generation with Pattern-Exploiting Training

Providing pretrained language models with simple task descriptions or pr...
research
10/09/2022

ASDOT: Any-Shot Data-to-Text Generation with Pretrained Language Models

Data-to-text generation is challenging due to the great variety of the i...
research
11/28/2022

Action-GPT: Leveraging Large-scale Language Models for Improved and Generalized Action Generation

We introduce Action-GPT, a plug-and-play framework for incorporating Lar...
research
07/14/2023

Using Large Language Models for Zero-Shot Natural Language Generation from Knowledge Graphs

In any system that uses structured knowledge graph (KG) data as its unde...
research
02/24/2021

Zero-Shot Text-to-Image Generation

Text-to-image generation has traditionally focused on finding better mod...
research
06/07/2023

Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions

Large language models (LLMs) can be used to generate text data for train...
research
10/20/2020

AutoMeTS: The Autocomplete for Medical Text Simplification

The goal of text simplification (TS) is to transform difficult text into...

Please sign up or login with your details

Forgot password? Click here to reset