ViMQ: A Vietnamese Medical Question Dataset for Healthcare Dialogue System Development

04/27/2023
by   Ta Duc Huy, et al.
0

Existing medical text datasets usually take the form of ques- tion and answer pairs that support the task of natural language gener- ation, but lacking the composite annotations of the medical terms. In this study, we publish a Vietnamese dataset of medical questions from patients with sentence-level and entity-level annotations for the Intent Classification and Named Entity Recognition tasks. The tag sets for two tasks are in medical domain and can facilitate the development of task- oriented healthcare chatbots with better comprehension of queries from patients. We train baseline models for the two tasks and propose a simple self-supervised training strategy with span-noise modelling that substan- tially improves the performance. Dataset and code will be published at https://github.com/tadeephuy/ViMQ

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2022

A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets

In recent years, interest has arisen in using machine learning to improv...
research
10/22/2017

Bringing Semantic Structures to User Intent Detection in Online Medical Queries

The Internet has revolutionized healthcare by offering medical informati...
research
01/17/2022

RuMedBench: A Russian Medical Language Understanding Benchmark

The paper describes the open Russian medical language understanding benc...
research
04/20/2022

LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs

The medical conversational system can relieve the burden of doctors and ...
research
06/29/2022

GERNERMED++: Transfer Learning in German Medical NLP

We present a statistical model for German medical natural language proce...
research
07/12/2022

OSLAT: Open Set Label Attention Transformer for Medical Entity Span Extraction

Identifying spans in medical texts that correspond to medical entities i...
research
02/17/2023

Med-EASi: Finely Annotated Dataset and Models for Controllable Simplification of Medical Texts

Automatic medical text simplification can assist providers with patient-...

Please sign up or login with your details

Forgot password? Click here to reset