Distantly supervised end-to-end medical entity extraction from electronic health records with human-level quality

01/25/2022
by   Alexander Nesterov, et al.
0

Medical entity extraction (EE) is a standard procedure used as a first stage in medical texts processing. Usually Medical EE is a two-step process: named entity recognition (NER) and named entity normalization (NEN). We propose a novel method of doing medical EE from electronic health records (EHR) as a single-step multi-label classification task by fine-tuning a transformer model pretrained on a large EHR dataset. Our model is trained end-to-end in an distantly supervised manner using targets automatically extracted from medical knowledge base. We show that our model learns to generalize for entities that are present frequently enough, achieving human-level classification quality for most frequent entities. Our work demonstrates that medical entity extraction can be done end-to-end without human supervision and with human quality given the availability of a large enough amount of unlabeled EHR and a medical knowledge base.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/09/2019

Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity Recognition

Traditional language models are unable to efficiently model entity names...
research
04/30/2020

Unlocking the Power of Deep PICO Extraction: Step-wise Medical NER Identification

The PICO framework (Population, Intervention, Comparison, and Outcome) i...
research
08/23/2023

Knowledge-injected Prompt Learning for Chinese Biomedical Entity Normalization

The Biomedical Entity Normalization (BEN) task aims to align raw, unstru...
research
04/27/2020

Knowledge Base Completion for Constructing Problem-Oriented Medical Records

Both electronic health records and personal health records are typically...
research
10/15/2019

Comprehend Medical: a Named Entity Recognition and Relationship Extraction Web Service

Comprehend Medical is a stateless and Health Insurance Portability and A...
research
07/12/2022

OSLAT: Open Set Label Attention Transformer for Medical Entity Span Extraction

Identifying spans in medical texts that correspond to medical entities i...
research
03/23/2020

E2EET: From Pipeline to End-to-end Entity Typing via Transformer-Based Embeddings

Entity Typing (ET) is the process of identifying the semantic types of e...

Please sign up or login with your details

Forgot password? Click here to reset