NER Models Using Pre-training and Transfer Learning for Healthcare

10/23/2019
by   Amogh Kamat Tarcar, et al.
0

In this paper, we present our approach to extract structured information from unstructured Electronic Health Records (EHR) [2] to study adverse drug reactions on patients, due to chemicals in their products. Our solution uses a combination of Natural Language Processing (NLP) techniques and a web-based annotation tool to optimize the performance of a custom Named Entity Recognition (NER) [1] model trained on a limited amount of EHR training data. We showcase a combination of tools and techniques leveraging the recent advancements in NLP aimed at targeting domain shifts by applying transfer learning and language model pre-training techniques [3]. We present a comparison of our technique to the base models available and show the effective increase in performance of the NER model and the reduction in time to annotate data. A key observation of the results presented is that the F1 score of model (0.734) trained with our approach with just 50 outperforms the F1 score of the blank spaCy model (0.704) trained with 100 the available training data. We also demonstrate an annotation tool to minimize domain expert time and the manual effort required to generate such a training dataset. Further, we plan to release the annotated dataset as well as the pre-trained model to the community to further research in medical health records.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2020

Med7: a transferable clinical natural language processing model for electronic health records

The field of clinical natural language processing has been advanced sign...
research
10/06/2022

HealthE: Classifying Entities in Online Textual Health Advice

The processing of entities in natural language is essential to many medi...
research
11/13/2018

Few-shot Learning for Named Entity Recognition in Medical Text

Deep neural network models have recently achieved state-of-the-art perfo...
research
01/06/2019

Named Entity Recognition in Electronic Health Records Using Transfer Learning Bootstrapped Neural Networks

Neural networks (NNs) have become the state of the art in many machine l...
research
10/28/2019

Attention-Gated Graph Convolution for Extracting Drugs and Their Interactions from Drug Labels

Preventable adverse events as a result of medical errors present a growi...
research
09/24/2021

GERNERMED – An Open German Medical NER Model

The current state of adoption of well-structured electronic health recor...
research
01/05/2022

Mining Adverse Drug Reactions from Unstructured Mediums at Scale

Adverse drug reactions / events (ADR/ADE) have a major impact on patient...

Please sign up or login with your details

Forgot password? Click here to reset