Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation

04/09/2016
by   Antonio Jimeno Yepes, et al.
0

Word sense disambiguation helps identifying the proper sense of ambiguous words in text. With large terminologies such as the UMLS Metathesaurus ambiguities appear and highly effective disambiguation methods are required. Supervised learning algorithm methods are used as one of the approaches to perform disambiguation. Features extracted from the context of an ambiguous word are used to identify the proper sense of such a word. The type of features have an impact on machine learning methods, thus affect disambiguation performance. In this work, we have evaluated several types of features derived from the context of the ambiguous word and we have explored as well more global features derived from MEDLINE using word embeddings. Results show that word embeddings improve the performance of more traditional features and allow as well using recurrent neural network classifiers based on Long-Short Term Memory (LSTM) nodes. The combination of unigrams and word embeddings with an SVM sets a new state of the art performance with a macro accuracy of 95.97 in the MSH WSD data set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2017

Long Short-Term Memory for Japanese Word Segmentation

This study presents a Long Short-Term Memory (LSTM) neural network appro...
research
05/20/2021

TF-IDF vs Word Embeddings for Morbidity Identification in Clinical Notes: An Initial Study

Today, we are seeing an ever-increasing number of clinical notes that co...
research
02/03/2020

Twitter Bot Detection Using Bidirectional Long Short-term Memory Neural Networks and Word Embeddings

Twitter is a web application playing dual roles of online social network...
research
02/25/2018

One Single Deep Bidirectional LSTM Network for Word Sense Disambiguation of Text Data

Due to recent technical and scientific advances, we have a wealth of inf...
research
04/10/2019

Detecting Cybersecurity Events from Noisy Short Text

It is very critical to analyze messages shared over social networks for ...
research
01/03/2020

Question Type Classification Methods Comparison

The paper presents a comparative study of state-of-the-art approaches fo...

Please sign up or login with your details

Forgot password? Click here to reset