Negation detection in Dutch clinical texts: an evaluation of rule-based and machine learning methods

09/01/2022
by   Bram van Es, et al.
0

As structured data are often insufficient, labels need to be extracted from free text in electronic health records when developing models for clinical information retrieval and decision support systems. One of the most important contextual properties in clinical text is negation, which indicates the absence of findings. We aimed to improve large scale extraction of labels by comparing three methods for negation detection in Dutch clinical notes. We used the Erasmus Medical Center Dutch Clinical Corpus to compare a rule-based method based on ContextD, a biLSTM model using MedCAT and (finetuned) RoBERTa-based models. We found that both the biLSTM and RoBERTa models consistently outperform the rule-based model in terms of F1 score, precision and recall. In addition, we systematically categorized the classification errors for each model, which can be used to further improve model performance in particular applications. Combining the three models naively was not beneficial in terms of performance. We conclude that the biLSTM and RoBERTa-based models in particular are highly accurate accurate in detecting clinical negations, but that ultimately all three approaches can be viable depending on the use case at hand.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2019

Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review

Of the 2652 articles considered, 106 met the inclusion criteria. Review ...
research
12/16/2017

NegBio: a high-performance tool for negation and uncertainty detection in radiology reports

Negative and uncertain medical findings are frequent in radiology report...
research
08/16/2023

Large Language Models for Granularized Barrett's Esophagus Diagnosis Classification

Diagnostic codes for Barrett's esophagus (BE), a precursor to esophageal...
research
07/11/2015

A new hybrid stemming algorithm for Persian

Stemming has been an influential part in Information retrieval and searc...
research
04/29/2019

Improving Mechanical Ventilator Clinical Decision Support Systems with A Machine Learning Classifier for Determining Ventilator Mode

Clinical decision support systems (CDSS) will play an in-creasing role i...
research
02/07/2023

Undersampling and Cumulative Class Re-decision Methods to Improve Detection of Agitation in People with Dementia

Agitation is one of the most prevalent symptoms in people with dementia ...
research
09/06/2017

"Having 2 hours to write a paper is fun!": Detecting Sarcasm in Numerical Portions of Text

Sarcasm occurring due to the presence of numerical portions in text has ...

Please sign up or login with your details

Forgot password? Click here to reset