Joint Retrieval and Generation Training for Grounded Text Generation

05/14/2021
by   Yizhe Zhang, et al.
2

Recent advances in large-scale pre-training such as GPT-3 allow seemingly high quality text to be generated from a given prompt. However, such generation systems often suffer from problems of hallucinated facts, and are not inherently designed to incorporate useful external information. Grounded generation models appear to offer remedies, but their training typically relies on rarely-available parallel data where corresponding documents are provided for context. We propose a framework that alleviates this data constraint by jointly training a grounded generator and document retriever on the language model signal. The model learns to retrieve the documents with the highest utility in generation and attentively combines them in the output. We demonstrate that by taking advantage of external references our approach can produce more informative and interesting text in both prose and dialogue generation.

READ FULL TEXT
research
04/26/2021

Focused Attention Improves Document-Grounded Generation

Document grounded generation is the task of using the information provid...
research
10/14/2021

Hindsight: Posterior-guided training of retrievers for improved open-ended generation

Many text generation systems benefit from using a retriever to retrieve ...
research
08/01/2023

ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation

Image-grounded dialogue systems benefit greatly from integrating visual ...
research
11/03/2022

Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation

Recent advances in large-scale pre-training provide large models with th...
research
02/23/2023

Coarse-to-Fine Knowledge Selection for Document Grounded Dialogs

Multi-document grounded dialogue systems (DGDS) belong to a class of con...
research
05/05/2023

Expository Text Generation: Imitate, Retrieve, Paraphrase

Expository documents are vital resources for conveying complex informati...
research
05/17/2020

MixingBoard: a Knowledgeable Stylized Integrated Text Generation Platform

We present MixingBoard, a platform for quickly building demos with a foc...

Please sign up or login with your details

Forgot password? Click here to reset