A New Data Representation Based on Training Data Characteristics to Extract Drug Named-Entity in Medical Text

10/06/2016
by   Sadikin Mujiono, et al.
0

One essential task in information extraction from the medical corpus is drug name recognition. Compared with text sources come from other domains, the medical text is special and has unique characteristics. In addition, the medical text mining poses more challenges, e.g., more unstructured text, the fast growing of new terms addition, a wide range of name variation for the same drug. The mining is even more challenging due to the lack of labeled dataset sources and external knowledge, as well as multiple token representations for a single drug name that is more common in the real application setting. Although many approaches have been proposed to overwhelm the task, some problems remained with poor F-score performance (less than 0.75). This paper presents a new treatment in data representation techniques to overcome some of those challenges. We propose three data representation techniques based on the characteristics of word distribution and word similarities as a result of word embedding training. The first technique is evaluated with the standard NN model, i.e., MLP (Multi-Layer Perceptrons). The second technique involves two deep network classifiers, i.e., DBN (Deep Belief Networks), and SAE (Stacked Denoising Encoders). The third technique represents the sentence as a sequence that is evaluated with a recurrent NN model, i.e., LSTM (Long Short Term Memory). In extracting the drug name entities, the third technique gives the best F-score performance compared to the state of the art, with its average F-score being 0.8645.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/28/2017

Drug-Drug Interaction Extraction from Biomedical Text Using Long Short Term Memory Network

Simultaneous administration of multiple drugs can have synergistic or an...
research
11/05/2019

A Deep Learning approach for Hindi Named Entity Recognition

Named Entity Recognition is one of the most important text processing re...
research
08/08/2023

Predicting Drug-Drug Interactions Using Knowledge Graphs

In the last decades, people have been consuming and combining more drugs...
research
12/20/2020

A hybrid deep-learning approach for complex biochemical named entity recognition

Named entity recognition (NER) of chemicals and drugs is a critical doma...
research
08/04/2019

Drug-Drug Interaction Prediction Based on Knowledge Graph Embeddings and Convolutional-LSTM Network

Interference between pharmacological substances can cause serious medica...
research
08/28/2020

Two Step Joint Model for Drug Drug Interaction Extraction

When patients need to take medicine, particularly taking more than one k...
research
05/30/2020

Transforming unstructured voice and text data into insight for paramedic emergency service using recurrent and convolutional neural networks

Paramedics often have to make lifesaving decisions within a limited time...

Please sign up or login with your details

Forgot password? Click here to reset