Effective Feature Representation for Clinical Text Concept Extraction

10/31/2018
by   Yifeng Tao, et al.
0

Crucial information about the practice of healthcare is recorded only in free-form text, which creates an enormous opportunity for high-impact NLP. However, annotated healthcare datasets tend to be small and expensive to obtain, which raises the question of how to make maximally efficient uses of the available data. To this end, we develop an LSTM-CRF model for combining unsupervised word representations and hand-built feature representations derived from publicly available healthcare ontologies. We show that this combined model yields superior performance on five datasets of diverse kinds of healthcare text (clinical, social, scientific, commercial). Each involves the labeling of complex, multi-word spans that pick out different healthcare concepts. We also introduce a new labeled dataset for identifying the treatment relations between drugs and diseases.

READ FULL TEXT
research
11/23/2020

An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare

Reinforcement Learning (RL) has recently been applied to sequential esti...
research
01/10/2023

Language Models sounds the Death Knell of Knowledge Graphs

Healthcare domain generates a lot of unstructured and semi-structured te...
research
12/22/2022

Word Embedding Neural Networks to Advance Knee Osteoarthritis Research

Osteoarthritis (OA) is the most prevalent chronic joint disease worldwid...
research
08/24/2019

Representation Learning with Autoencoders for Electronic Health Records: A Comparative Study

Increasing volume of Electronic Health Records (EHR) in recent years pro...
research
03/20/2023

Leveraging Foundation Models for Clinical Text Analysis

Infectious diseases are a significant public health concern globally, an...
research
05/26/2022

Clinical Dialogue Transcription Error Correction using Seq2Seq Models

Good communication is critical to good healthcare. Clinical dialogue is ...
research
02/03/2020

Learning Contextualized Document Representations for Healthcare Answer Retrieval

We present Contextual Discourse Vectors (CDV), a distributed document re...

Please sign up or login with your details

Forgot password? Click here to reset