EHRKit: A Python Natural Language Processing Toolkit for Electronic Health Record Texts

04/13/2022
by   Irene Li, et al.
29

The Electronic Health Record (EHR) is an essential part of the modern medical system and impacts healthcare delivery, operations, and research. Unstructured text is attracting much attention despite structured information in the EHRs and has become an exciting research field. The success of the recent neural Natural Language Processing (NLP) method has led to a new direction for processing unstructured clinical notes. In this work, we create a python library for clinical texts, EHRKit. This library contains two main parts: MIMIC-III-specific functions and tasks specific functions. The first part introduces a list of interfaces for accessing MIMIC-III NOTEEVENTS data, including basic search, information retrieval, and information extraction. The second part integrates many third-party libraries for up to 12 off-shelf NLP tasks such as named entity recognition, summarization, machine translation, etc.

READ FULL TEXT

Authors

page 1

page 2

page 3

page 4

08/14/2019

Two-stage Federated Phenotyping and Patient Representation Learning

A large percentage of medical information is in unstructured text format...
07/07/2021

Neural Natural Language Processing for Unstructured Data in Electronic Health Records: a Review

Electronic health records (EHRs), digital collections of patient healthc...
06/02/2021

Multilingual Medical Question Answering and Information Retrieval for Rural Health Intelligence Access

In rural regions of several developing countries, access to quality heal...
01/22/2019

CREATE: Cohort Retrieval Enhanced by Analysis of Text from Electronic Health Records using OMOP Common Data Model

Background: Widespread adoption of electronic health records (EHRs) has ...
10/20/2019

A Semi-Automated Approach for Information Extraction, Classification and Analysis of Unstructured Data

In this paper, we show how Quantitative Narrative Analysis and simple Na...
09/18/2018

Lung Cancer Concept Annotation from Spanish Clinical Narratives

Recent rapid increase in the generation of clinical data and rapid devel...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.