The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents

04/03/2023
by   Xing Han Lu, et al.
0

We introduce the StatCan Dialogue Dataset consisting of 19,379 conversation turns between agents working at Statistics Canada and online users looking for published data tables. The conversations stem from genuine intents, are held in English or French, and lead to agents retrieving one of over 5000 complex data tables. Based on this dataset, we propose two tasks: (1) automatic retrieval of relevant tables based on a on-going conversation, and (2) automatic generation of appropriate agent responses at each turn. We investigate the difficulty of each task by establishing strong baselines. Our experiments on a temporal data split reveal that all models struggle to generalize to future conversations, as we observe a significant drop in performance across both tasks when we move from the validation to the test set. In addition, we find that response generation models struggle to decide when to return a table. Considering that the tasks pose significant challenges to existing models, we encourage the community to develop models for our task, which can be directly used to help knowledge workers find relevant tables for live chat users.

READ FULL TEXT

page 20

page 21

research
04/28/2022

HybriDialogue: An Information-Seeking Dialogue Dataset Grounded on Tabular and Textual Data

A pressing challenge in current dialogue systems is to successfully conv...
research
04/06/2023

Pragmatically Appropriate Diversity for Dialogue Evaluation

Linguistic pragmatics state that a conversation's underlying speech acts...
research
01/20/2019

Dialogue Design and Management for Multi-Session Casual Conversation with Older Adults

We address the problem of designing a conversational avatar capable of a...
research
05/19/2022

Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation

Target-guided response generation enables dialogue systems to smoothly t...
research
01/16/2019

Learning from Dialogue after Deployment: Feed Yourself, Chatbot!

The majority of conversations a dialogue agent sees over its lifetime oc...
research
11/02/2018

Neural Response Ranking for Social Conversation: A Data-Efficient Approach

The overall objective of 'social' dialogue systems is to support engagin...
research
07/13/2023

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations

Applications that could benefit from automatic understanding of human-hu...

Please sign up or login with your details

Forgot password? Click here to reset