Lung Cancer Concept Annotation from Spanish Clinical Narratives

09/18/2018
by   Marjan Najafabadipour, et al.
0

Recent rapid increase in the generation of clinical data and rapid development of computational science make us able to extract new insights from massive datasets in healthcare industry. Oncological clinical notes are creating rich databases for documenting patients history and they potentially contain lots of patterns that could help in better management of the disease. However, these patterns are locked within free text (unstructured) portions of clinical documents and consequence in limiting health professionals to extract useful information from them and to finally perform Query and Answering (QA) process in an accurate way. The Information Extraction (IE) process requires Natural Language Processing (NLP) techniques to assign semantics to these patterns. Therefore, in this paper, we analyze the design of annotators for specific lung cancer concepts that can be integrated over Apache Unstructured Information Management Architecture (UIMA) framework. In addition, we explain the details of generation and storage of annotation outcomes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2023

Question-Answering System Extracts Information on Injection Drug Use from Clinical Progress Notes

Injection drug use (IDU) is a dangerous health behavior that increases m...
research
12/15/2020

Enriched Annotations for Tumor Attribute Classification from Pathology Reports with Limited Labeled Data

Precision medicine has the potential to revolutionize healthcare, but mu...
research
08/14/2019

Two-stage Federated Phenotyping and Patient Representation Learning

A large percentage of medical information is in unstructured text format...
research
03/17/2015

Prediction Using Note Text: Synthetic Feature Creation with word2vec

word2vec affords a simple yet powerful approach of extracting quantitati...
research
11/21/2022

Unsupervised extraction, labelling and clustering of segments from clinical notes

This work is motivated by the scarcity of tools for accurate, unsupervis...
research
10/02/2020

Multi-domain Clinical Natural Language Processing with MedCAT: the Medical Concept Annotation Toolkit

Electronic health records (EHR) contain large volumes of unstructured te...
research
05/18/2022

A Scalable Workflow to Build Machine Learning Classifiers with Clinician-in-the-Loop to Identify Patients in Specific Diseases

Clinicians may rely on medical coding systems such as International Clas...

Please sign up or login with your details

Forgot password? Click here to reset