Sentence Semantic Regression for Text Generation

08/06/2021
by   Wei Wang, et al.
12

Recall the classical text generation works, the generation framework can be briefly divided into two phases: idea reasoning and surface realization. The target of idea reasoning is to figure out the main idea which will be presented in the following talking/writing periods. Surface realization aims to arrange the most appropriate sentence to depict and convey the information distilled from the main idea. However, the current popular token-by-token text generation methods ignore this crucial process and suffer from many serious issues, such as idea/topic drift. To tackle the problems and realize this two-phase paradigm, we propose a new framework named Sentence Semantic Regression (SSR) based on sentence-level language modeling. For idea reasoning, two architectures SSR-AR and SSR-NonAR are designed to conduct sentence semantic regression autoregressively (like GPT2/3) and bidirectionally (like BERT). In the phase of surface realization, a mixed-granularity sentence decoder is designed to generate text with better consistency by jointly incorporating the predicted sentence-level main idea as well as the preceding contextual token-level information. We conduct experiments on four tasks of story ending prediction, story ending generation, dialogue generation, and sentence infilling. The results show that SSR can obtain better performance in terms of automatic metrics and human evaluation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/28/2020

Graph-based Multi-hop Reasoning for Long Text Generation

Long text generation is an important but challenging task.The main probl...
research
04/21/2019

BERTScore: Evaluating Text Generation with BERT

We propose BERTScore, an automatic evaluation metric for text generation...
research
09/02/2019

Sentence-Level Content Planning and Style Specification for Neural Text Generation

Building effective text generation systems requires three critical compo...
research
10/09/2020

Online Back-Parsing for AMR-to-Text Generation

AMR-to-text generation aims to recover a text containing the same meanin...
research
04/18/2021

A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text Generation

Large pretrained generative models like GPT-3 often suffer from hallucin...
research
07/23/2019

Learning to Select, Track, and Generate for Data-to-Text

We propose a data-to-text generation model with two modules, one for tra...
research
09/07/2023

Chasing Consistency in Text-to-3D Generation from a Single Image

Text-to-3D generation from a single-view image is a popular but challeng...

Please sign up or login with your details

Forgot password? Click here to reset