A Repository of Conversational Datasets

04/13/2019
by   Matthew Henderson, et al.
0

Progress in Machine Learning is often driven by the availability of large datasets, and consistent evaluation metrics for comparing modeling approaches. To this end, we present a repository of conversational datasets consisting of hundreds of millions of examples, and a standardised evaluation procedure for conversational response selection models using '1-of-100 accuracy'. The repository contains scripts that allow researchers to reproduce the standard datasets, or to adapt the pre-processing and data filtering steps to their needs. We introduce and evaluate several competitive baselines for conversational response selection, whose implementations are shared in the repository, as well as a neural encoder model that is trained on the entire training set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/19/2020

Common Conversational Community Prototype: Scholarly Conversational Assistant

This paper discusses the potential for creating academic resources (tool...
research
05/14/2019

Improving Neural Conversational Models with Entropy-Based Data Filtering

Current neural-network based conversational models lack diversity and ge...
research
10/24/2020

An Evaluation Protocol for Generative Conversational Systems

There is a multitude of novel generative models for open-domain conversa...
research
08/12/2018

Addressee and Response Selection for Multilingual Conversation

Developing conversational systems that can converse in many languages is...
research
10/07/2022

Unsupervised Neural Stylistic Text Generation using Transfer learning and Adapters

Research has shown that personality is a key driver to improve engagemen...
research
09/10/2019

A Crowd-based Evaluation of Abuse Response Strategies in Conversational Agents

How should conversational agents respond to verbal abuse through the use...
research
06/08/2015

ASlib: A Benchmark Library for Algorithm Selection

The task of algorithm selection involves choosing an algorithm from a se...

Please sign up or login with your details

Forgot password? Click here to reset