CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes

06/04/2021
by   James Mullenbach, et al.
0

Continuity of care is crucial to ensuring positive health outcomes for patients discharged from an inpatient hospital setting, and improved information sharing can help. To share information, caregivers write discharge notes containing action items to share with patients and their future caregivers, but these action items are easily lost due to the lengthiness of the documents. In this work, we describe our creation of a dataset of clinical action items annotated over MIMIC-III, the largest publicly available dataset of real clinical notes. This dataset, which we call CLIP, is annotated by physicians and covers 718 documents representing 100K sentences. We describe the task of extracting the action items from these documents as multi-aspect extractive summarization, with each aspect representing a type of action to be taken. We evaluate several machine learning models on this task, and show that the best models exploit in-domain language model pre-training on 59K unannotated documents, and incorporate context from neighboring sentences. We also propose an approach to pre-training data selection that allows us to explore the trade-off between size and domain-specificity of pre-training datasets for this task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2022

Summarizing Patients Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models

Automatically summarizing patients' main problems from daily progress no...
research
02/24/2023

Modelling Temporal Document Sequences for Clinical ICD Coding

Past studies on the ICD coding problem focus on predicting clinical code...
research
04/10/2019

ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission

Clinical notes contain information about patients that goes beyond struc...
research
03/07/2023

A Meta-Evaluation of Faithfulness Metrics for Long-Form Hospital-Course Summarization

Long-form clinical summarization of hospital admissions has real-world s...
research
05/12/2022

ScAN: Suicide Attempt and Ideation Events Dataset

Suicide is an important public health concern and one of the leading cau...
research
10/12/2020

Extracting Angina Symptoms from Clinical Notes Using Pre-Trained Transformer Architectures

Anginal symptoms can connote increased cardiac risk and a need for chang...
research
06/05/2023

PULSAR: Pre-training with Extracted Healthcare Terms for Summarising Patients' Problems and Data Augmentation with Black-box Large Language Models

Medical progress notes play a crucial role in documenting a patient's ho...

Please sign up or login with your details

Forgot password? Click here to reset