Pchatbot: A Large-Scale Dataset for Personalized Chatbot

09/28/2020
by   Xiaohe Li, et al.
13

Natural language dialogue systems raise great attention recently. As many dialogue models are data-driven, high quality datasets are essential to these systems. In this paper, we introduce Pchatbot, a large scale dialogue dataset which contains two subsets collected from Weibo and Judical forums respectively. Different from existing datasets which only contain post-response pairs, we include anonymized user IDs as well as timestamps. This enables the development of personalized dialogue models which depend on the availability of users' historical conversations. Furthermore, the scale of Pchatbot is significantly larger than existing datasets, which might benefit the data-driven models. Our preliminary experimental study shows that a personalized chatbot model trained on Pchatbot outperforms the corresponding ad-hoc chatbot models. We also demonstrate that using larger dataset improves the quality of dialog models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2022

Less is More: Learning to Refine Dialogue History for Personalized Dialogue Generation

Personalized dialogue systems explore the problem of generating response...
research
08/10/2020

A Large-Scale Chinese Short-Text Conversation Dataset

The advancements of neural dialogue generation models show promising res...
research
12/12/2021

ValueNet: A New Dataset for Human Value Driven Dialogue System

Building a socially intelligent agent involves many challenges, one of w...
research
09/20/2020

Dialogue Distillation: Open-domain Dialogue Augmentation Using Unpaired Data

Recent advances in open-domain dialogue systems rely on the success of n...
research
04/29/2022

"My nose is running.""Are you also coughing?": Building A Medical Diagnosis Agent with Interpretable Inquiry Logics

With the rise of telemedicine, the task of developing Dialogue Systems f...
research
06/16/2019

Persuasion for Good: Towards a Personalized Persuasive Dialogue System for Social Good

Developing intelligent persuasive conversational agents to change people...
research
01/28/2019

Personalized Dialogue Generation with Diversified Traits

Endowing a dialogue system with particular personality traits is essenti...

Please sign up or login with your details

Forgot password? Click here to reset