Natural language understanding for task oriented dialog in the biomedical domain in a low resources context

11/23/2018
by   Antoine Neuraz, et al.
0

In the biomedical domain, the lack of sharable datasets often limit the possibility of developing natural language processing systems, especially dialogue applications and natural language understanding models. To overcome this issue, we explore data generation using templates and terminologies and data augmentation approaches. Namely, we report our experiments using paraphrasing and word representations learned on a large EHR corpus with Fasttext and ELMo, to learn a NLU model without any available dataset. We evaluate on a NLU task of natural language queries in EHRs divided in slot-filling and intent classification sub-tasks. On the slot-filling task, we obtain a F-score of 0.76 with the ELMo representation; and on the classification task, a mean F-score of 0.71. Our results show that this method could be used to develop a baseline system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2021

A survey of joint intent detection and slot-filling models in natural language understanding

Intent classification and slot filling are two critical tasks for natura...
research
06/10/2021

AUGNLG: Few-shot Natural Language Generation using Self-trained Data Augmentation

Natural Language Generation (NLG) is a key component in a task-oriented ...
research
09/15/2017

Harvesting Creative Templates for Generating Stylistically Varied Restaurant Reviews

Many of the creative and figurative elements that make language exciting...
research
11/01/2020

Recent Neural Methods on Slot Filling and Intent Classification for Task-Oriented Dialogue Systems: A Survey

In recent years, fostered by deep learning technologies and by the high ...
research
06/12/2020

A Generative Model for Joint Natural Language Understanding and Generation

Natural language understanding (NLU) and natural language generation (NL...
research
04/27/2022

NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural Language Understanding in Task-Oriented Dialogue

We present NLU++, a novel dataset for natural language understanding (NL...
research
03/28/2019

A dataset for resolving referring expressions in spoken dialogue via contextual query rewrites (CQR)

We present Contextual Query Rewrite (CQR) a dataset for multi-domain tas...

Please sign up or login with your details

Forgot password? Click here to reset