ArgSciChat: A Dataset for Argumentative Dialogues on Scientific Papers

02/14/2022
by   Federico Ruggeri, et al.
0

The applications of conversational agents for scientific disciplines (as expert domains) are understudied due to the lack of dialogue data to train such agents. While most data collection frameworks, such as Amazon Mechanical Turk, foster data collection for generic domains by connecting crowd workers and task designers, these frameworks are not much optimized for data collection in expert domains. Scientists are rarely present in these frameworks due to their limited time budget. Therefore, we introduce a novel framework to collect dialogues between scientists as domain experts on scientific papers. Our framework lets scientists present their scientific papers as groundings for dialogues and participate in dialogue they like its paper title. We use our framework to collect a novel argumentative dialogue dataset, ArgSciChat. It consists of 498 messages collected from 41 dialogues on 20 scientific papers. Alongside extensive analysis on ArgSciChat, we evaluate a recent conversational agent on our dataset. Experimental results show that this agent poorly performs on ArgSciChat, motivating further research on argumentative scientific agents. We release our framework and the dataset.

READ FULL TEXT
research
05/01/2017

MACA: A Modular Architecture for Conversational Agents

We propose a software architecture designed to ease the implementation o...
research
12/07/2020

The Lab vs The Crowd: An Investigation into Data Quality for Neural Dialogue Models

Challenges around collecting and processing quality data have hampered p...
research
01/15/2018

Building a Conversational Agent Overnight with Dialogue Self-Play

We propose Machines Talking To Machines (M2M), a framework combining aut...
research
08/30/2021

Semi-Supervised Exaggeration Detection of Health Science Press Releases

Public trust in science depends on honest and factual communication of s...
research
01/10/2018

Exploring Stereotypes and Biased Data with the Crowd

The goal of our research is to contribute information about how useful t...
research
02/21/2017

Learning to Generate Posters of Scientific Papers by Probabilistic Graphical Models

Researchers often summarize their work in the form of scientific posters...
research
04/05/2016

Learning to Generate Posters of Scientific Papers

Researchers often summarize their work in the form of posters. Posters p...

Please sign up or login with your details

Forgot password? Click here to reset