MedDG: A Large-scale Medical Consultation Dataset for Building Medical Dialogue System

10/15/2020
by   Wenge Liu, et al.
0

Developing conversational agents to interact with patients and provide primary clinical advice has attracted increasing attention due to its huge application potential, especially in the time of COVID-19 Pandemic. However, the training of end-to-end neural-based medical dialogue system is restricted by an insufficient quantity of medical dialogue corpus. In this work, we make the first attempt to build and release a large-scale high-quality Medical Dialogue dataset related to 12 types of common Gastrointestinal diseases named MedDG, with more than 17K conversations collected from the online health consultation community. Five different categories of entities, including diseases, symptoms, attributes, tests, and medicines, are annotated in each conversation of MedDG as additional labels. To push forward the future research on building expert-sensitive medical dialogue system, we proposes two kinds of medical dialogue tasks based on MedDG dataset. One is the next entity prediction and the other is the doctor response generation. To acquire a clear comprehension on these two medical dialogue tasks, we implement several state-of-the-art benchmarks, as well as design two dialogue models with a further consideration on the predicted entities. Experimental results show that the pre-train language models and other baselines struggle on both tasks with poor performance in our dataset, and the response quality can be enhanced with the help of auxiliary entity information. From human evaluation, the simple retrieval model outperforms several state-of-the-art generative models, indicating that there still remains a large room for improvement on generating medically meaningful responses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2020

MedDialog: A Large-scale Medical Dialogue Dataset

Medical dialogue systems are promising in assisting in telemedicine to i...
research
08/03/2021

More but Correct: Generating Diversified and Entity-revised Medical Response

Medical Dialogue Generation (MDG) is intended to build a medical dialogu...
research
11/28/2022

Automatically Extracting Information in Medical Dialogue: Expert System And Attention for Labelling

Medical dialogue information extraction is becoming an increasingly sign...
research
09/01/2021

M^2-MedDialog: A Dataset and Benchmarks for Multi-domain Multi-service Medical Dialogues

Medical dialogue systems (MDSs) aim to assist doctors and patients with ...
research
03/18/2022

Prompt-based Generative Approach towards Multi-Hierarchical Medical Dialogue State Tracking

The medical dialogue system is a promising application that can provide ...
research
05/19/2023

Plug-and-Play Medical Dialogue System

Medical dialogue systems aim to provide accurate answers to patients, ne...
research
05/11/2020

On the Generation of Medical Dialogues for COVID-19

Under the pandemic of COVID-19, people experiencing COVID19-related symp...

Please sign up or login with your details

Forgot password? Click here to reset