A Dataset for Document Grounded Conversations

09/19/2018
by   Kangyan Zhou, et al.
0

This paper introduces a document grounded dataset for text conversations. We define "Document Grounded Conversations" as conversations that are about the contents of a specified document. In this dataset the specified documents were Wikipedia articles about popular movies. The dataset contains 4112 conversations with an average of 21.43 turns per conversation. This positions this dataset to not only provide a relevant chat history while generating responses but also provide a source of information that the models could use. We describe two neural architectures that provide benchmark performance on the task of generating the next response. We also evaluate our models for engagement and fluency, and find that the information from the document helps in generating more engaging and fluent responses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2019

Incremental Transformer with Deliberation Decoder for Document Grounded Conversations

Document Grounded Conversations is a task to generate dialogue responses...
research
04/08/2019

A Method to Discover Digital Collaborative Conversations in Business Collaborations

Many companies have a suite of digital tools, such as Enterprise Social ...
research
10/20/2022

Doc2Bot: Accessing Heterogeneous Documents via Conversational Bots

This paper introduces Doc2Bot, a novel dataset for building machines tha...
research
11/23/2021

Variational Learning for Unsupervised Knowledge Grounded Dialogs

Recent methods for knowledge grounded dialogs generate responses by inco...
research
08/06/2018

Paying Attention to Attention: Highlighting Influential Samples in Sequential Analysis

In (Yang et al. 2016), a hierarchical attention network (HAN) is created...
research
10/14/2021

Hindsight: Posterior-guided training of retrievers for improved open-ended generation

Many text generation systems benefit from using a retriever to retrieve ...
research
12/16/2021

Adapting Document-Grounded Dialog Systems to Spoken Conversations using Data Augmentation and a Noisy Channel Model

This paper summarizes our submission to Task 2 of the second track of th...

Please sign up or login with your details

Forgot password? Click here to reset