MedDialog: A Large-scale Medical Dialogue Dataset

by   Shu Chen, et al.

Medical dialogue systems are promising in assisting in telemedicine to increase access to healthcare services, improve the quality of patient care, and reduce medical costs. To facilitate the research and development of medical dialogue systems, we build a large-scale medical dialogue dataset – MedDialog – that contains 1.1 million conversations between patients and doctors and 4 million utterances. To our best knowledge, MedDialog is the largest medical dialogue dataset to date. The dataset is available at


page 1

page 2

page 3

page 4


A Spoken Drug Prescription Dataset in French for Spoken Language Understanding

Spoken medical dialogue systems are increasingly attracting interest to ...

M^2-MedDialog: A Dataset and Benchmarks for Multi-domain Multi-service Medical Dialogues

Medical dialogue systems (MDSs) aim to assist doctors and patients with ...

On the Generation of Medical Dialogues for COVID-19

Under the pandemic of COVID-19, people experiencing COVID19-related symp...

ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge

Recent large language models (LLMs) in the general domain, such as ChatG...

BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Datasets

To build open-domain chatbots that are able to use diverse communicative...

MedDG: A Large-scale Medical Consultation Dataset for Building Medical Dialogue System

Developing conversational agents to interact with patients and provide p...

Prompt-based Generative Approach towards Multi-Hierarchical Medical Dialogue State Tracking

The medical dialogue system is a promising application that can provide ...

Code Repositories

Please sign up or login with your details

Forgot password? Click here to reset