Natural language processing to automatically extract the presence and severity of esophagitis in notes of patients undergoing radiotherapy

03/24/2023
by   Shan Chen, et al.
0

Radiotherapy (RT) toxicities can impair survival and quality-of-life, yet remain under-studied. Real-world evidence holds potential to improve our understanding of toxicities, but toxicity information is often only in clinical notes. We developed natural language processing (NLP) models to identify the presence and severity of esophagitis from notes of patients treated with thoracic RT. We fine-tuned statistical and pre-trained BERT-based models for three esophagitis classification tasks: Task 1) presence of esophagitis, Task 2) severe esophagitis or not, and Task 3) no esophagitis vs. grade 1 vs. grade 2-3. Transferability was tested on 345 notes from patients with esophageal cancer undergoing RT. Fine-tuning PubmedBERT yielded the best performance. The best macro-F1 was 0.92, 0.82, and 0.74 for Task 1, 2, and 3, respectively. Selecting the most informative note sections during fine-tuning improved macro-F1 by over 2 all tasks. Silver-labeled data improved the macro-F1 by over 3 tasks. For the esophageal cancer notes, the best macro-F1 was 0.73, 0.74, and 0.65 for Task 1, 2, and 3, respectively, without additional fine-tuning. To our knowledge, this is the first effort to automatically extract esophagitis toxicity severity according to CTCAE guidelines from clinic notes. The promising performance provides proof-of-concept for NLP-based automated detailed toxicity monitoring in expanded domains.

READ FULL TEXT

page 2

page 5

page 8

page 9

page 10

research
12/06/2022

Automated Identification of Eviction Status from Electronic Health Record Notes

Objective: Evictions are involved in a cascade of negative events that c...
research
06/16/2023

Revealing the impact of social circumstances on the selection of cancer therapy through natural language processing of social work notes

We aimed to investigate the impact of social circumstances on cancer the...
research
04/12/2022

Finding Trolls Under Bridges: Preliminary Work on a Motif Detector

Motifs are distinctive recurring elements found in folklore that have si...
research
03/28/2023

Soft-prompt tuning to predict lung cancer using primary care free-text Dutch medical notes

We investigate different natural language processing (NLP) approaches ba...
research
03/07/2023

At Your Fingertips: Extracting Piano Fingering Instructions from Videos

Piano fingering – knowing which finger to use to play each note in a mus...
research
10/12/2020

Extracting Angina Symptoms from Clinical Notes Using Pre-Trained Transformer Architectures

Anginal symptoms can connote increased cardiac risk and a need for chang...
research
01/02/2023

Adaptive Fine-tuning for Multiclass Classification over Software Requirement Data

The analysis of software requirement specifications (SRS) using Natural ...

Please sign up or login with your details

Forgot password? Click here to reset