Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity

04/08/2020
by   Hamza Harkous, et al.
0

End-to-end neural data-to-text (D2T) generation has recently emerged as an alternative to pipeline-based architectures. However, it has faced challenges in generalizing to new domains and generating semantically consistent text. In this work, we present DataTuner, a neural, end-to-end data-to-text generation system that makes minimal assumptions about the data representation and the target domain. We take a two-stage generation-reranking approach, combining a fine-tuned language model with a semantic fidelity classifier. Each of our components is learnt end-to-end without the need for dataset-specific heuristics, entity delexicalization, or post-processing. We show that DataTuner achieves state of the art results on the automated metrics across four major D2T datasets (LDC2017T10, WebNLG, ViGGO, and Cleaned E2E), with a fluency assessed by human annotators nearing or exceeding the human-written reference texts. We further demonstrate that the model-based semantic fidelity scorer in DataTuner is a better assessment tool compared to traditional, heuristic-based measures. Our generated text has a significantly better semantic fidelity than the state of the art across all four datasets

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/23/2019

Neural data-to-text generation: A comparison between pipeline and end-to-end architectures

Traditionally, most data-to-text applications have been designed using a...
research
10/08/2022

Comparing Computational Architectures for Automated Journalism

The majority of NLG systems have been designed following either a templa...
research
09/08/2018

Operations Guided Neural Networks for High Fidelity Data-To-Text Generation

Recent neural models for data-to-text generation are mostly based on dat...
research
06/07/2019

Data-to-text Generation with Entity Modeling

Recent approaches to data-to-text generation have shown great promise th...
research
11/03/2020

Data-to-Text Generation with Iterative Text Editing

We present a novel approach to data-to-text generation based on iterativ...
research
01/12/2020

Revisiting Challenges in Data-to-Text Generation with Fact Grounding

Data-to-text generation models face challenges in ensuring data fidelity...
research
06/28/2017

The E2E Dataset: New Challenges For End-to-End Generation

This paper describes the E2E data, a new dataset for training end-to-end...

Please sign up or login with your details

Forgot password? Click here to reset