HealthE: Classifying Entities in Online Textual Health Advice

10/06/2022
by   Joseph Gatto, et al.
0

The processing of entities in natural language is essential to many medical NLP systems. Unfortunately, existing datasets vastly under-represent the entities required to model public health relevant texts such as health advice often found on sites like WebMD. People rely on such information for personal health management and clinically relevant decision making. In this work, we release a new annotated dataset, HealthE, consisting of 6,756 health advice. HealthE has a more granular label space compared to existing medical NER corpora and contains annotation for diverse health phrases. Additionally, we introduce a new health entity classification model, EP S-BERT, which leverages textual context patterns in the classification of entity classes. EP S-BERT provides a 4-point increase in F1 score over the nearest baseline and a 34-point increase in F1 when compared to off-the-shelf medical NER tools trained to extract disease and medication mentions from clinical texts. All code and data are publicly available on Github.

READ FULL TEXT
research
10/23/2019

NER Models Using Pre-training and Transfer Learning for Healthcare

In this paper, we present our approach to extract structured information...
research
08/28/2023

ANER: Arabic and Arabizi Named Entity Recognition using Transformer-Based Approach

One of the main tasks of Natural Language Processing (NLP), is Named Ent...
research
09/22/2022

Scope of Pre-trained Language Models for Detecting Conflicting Health Information

An increasing number of people now rely on online platforms to meet thei...
research
04/07/2020

The Russian Drug Reaction Corpus and Neural Models for Drug Reactions and Effectiveness Detection in User Reviews

The Russian Drug Reaction Corpus (RuDReC) is a new partially annotated c...
research
05/26/2023

People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts

Although pre-trained named entity recognition (NER) models are highly ac...
research
04/05/2022

Multilinguals at SemEval-2022 Task 11: Transformer Based Architecture for Complex NER

We investigate the task of complex NER for the English language. The tas...
research
03/16/2023

The Scope of In-Context Learning for the Extraction of Medical Temporal Constraints

Medications often impose temporal constraints on everyday patient activi...

Please sign up or login with your details

Forgot password? Click here to reset