FastContext: an efficient and scalable implementation of the ConText algorithm

04/30/2019
by   Jianlin Shi, et al.
0

Objective: To develop and evaluate FastContext, an efficient, scalable implementation of the ConText algorithm suitable for very large-scale clinical natural language processing. Background: The ConText algorithm performs with state-of-art accuracy in detecting the experiencer, negation status, and temporality of concept mentions in clinical narratives. However, the speed limitation of its current implementations hinders its use in big data processing. Methods: We developed FastContext through hashing the ConText's rules, then compared its speed and accuracy with JavaConText and GeneralConText, two widely used Java implementations. Results: FastContext ran two orders of magnitude faster and was less decelerated by rule increase than the other two implementations used in this study for comparison. Additionally, FastContext consistently gained accuracy improvement as the rules increased (the desired outcome of adding new rules), while the other two implementations did not. Conclusions: FastContext is an efficient, scalable implementation of the popular ConText algorithm, suitable for natural language applications on very large clinical corpora.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2018

Big Data Analytic based on Scalable PANFIS for RFID Localization

RFID technology has gained popularity to address localization problem in...
research
06/01/2021

Validating GAN-BioBERT: A Methodology For Assessing Reporting Trends In Clinical Trials

In the past decade, there has been much discussion about the issue of bi...
research
03/02/2022

A New Framework for Expressing, Parallelizing and Optimizing Big Data Applications

The Forelem framework was first introduced as a means to optimize databa...
research
08/31/2018

Rx-Caffe: Framework for evaluating and training Deep Neural Networks on Resistive Crossbars

Deep Neural Networks (DNNs) are widely used to perform machine learning ...
research
02/02/2012

Resolving Implementation Ambiguity and Improving SURF

Speeded Up Robust Features (SURF) has emerged as one of the more popular...
research
10/13/2021

FlexiTerm: A more efficient implementation of flexible multi-word term recognition

Terms are linguistic signifiers of domain-specific concepts. Automated r...

Please sign up or login with your details

Forgot password? Click here to reset