DCH-2: A Parallel Customer-Helpdesk Dialogue Corpus with Distributions of Annotators' Labels

04/18/2021
by   Zhaohao Zeng, et al.
0

We introduce a data set called DCH-2, which contains 4,390 real customer-helpdesk dialogues in Chinese and their English translations. DCH-2 also contains dialogue-level annotations and turn-level annotations obtained independently from either 19 or 20 annotators. The data set was built through our effort as organisers of the NTCIR-14 Short Text Conversation and NTCIR-15 Dialogue Evaluation tasks, to help researchers understand what constitutes an effective customer-helpdesk dialogue, and thereby build efficient and helpful helpdesk systems that are available to customers at all times. In addition, DCH-2 may be utilised for other purposes, for example, as a repository for retrieval-based dialogue systems, or as a parallel corpus for machine translation in the helpdesk domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

research
09/27/2021

The JDDC 2.0 Corpus: A Large-Scale Multimodal Multi-Turn Chinese Dialogue Dataset for E-commerce Customer Service

With the development of the Internet, more and more people get accustome...
research
11/17/2020

KddRES: A Multi-level Knowledge-driven Dialogue Dataset for Restaurant Towards Customized Dialogue System

Compared with CrossWOZ (Chinese) and MultiWOZ (English) dataset which ha...
research
08/31/2022

Unified Knowledge Prompt Pre-training for Customer Service Dialogues

Dialogue bots have been widely applied in customer service scenarios to ...
research
08/10/2020

A Large-Scale Chinese Short-Text Conversation Dataset

The advancements of neural dialogue generation models show promising res...
research
03/03/2019

Detecting dementia in Mandarin Chinese using transfer learning from a parallel corpus

Machine learning has shown promise for automatic detection of Alzheimer'...
research
11/22/2019

The JDDC Corpus: A Large-Scale Multi-Turn Chinese Dialogue Dataset forE-commerce Customer Service

Human conversations in real scenarios are complicated and building a hum...
research
03/28/2017

A practical approach to dialogue response generation in closed domains

We describe a prototype dialogue response generation model for the custo...

Please sign up or login with your details

Forgot password? Click here to reset