ScAN: Suicide Attempt and Ideation Events Dataset

05/12/2022
by   Bhanu Pratap Singh Rawat, et al.
0

Suicide is an important public health concern and one of the leading causes of death worldwide. Suicidal behaviors, including suicide attempts (SA) and suicide ideations (SI), are leading risk factors for death by suicide. Information related to patients' previous and current SA and SI are frequently documented in the electronic health record (EHR) notes. Accurate detection of such documentation may help improve surveillance and predictions of patients' suicidal behaviors and alert medical professionals for suicide prevention efforts. In this study, we first built Suicide Attempt and Ideation Events (ScAN) dataset, a subset of the publicly available MIMIC III dataset spanning over 12k+ EHR notes with 19k+ annotated SA and SI events information. The annotations also contain attributes such as method of suicide attempt. We also provide a strong baseline model ScANER (Suicide Attempt and Ideation Events Retriever), a multi-task RoBERTa-based model with a retrieval module to extract all the relevant suicidal behavioral evidences from EHR notes of an hospital-stay and, and a prediction module to identify the type of suicidal behavior (SA and SI) concluded during the patient's stay at the hospital. ScANER achieved a macro-weighted F1-score of 0.83 for identifying suicidal behavioral evidences and a macro F1-score of 0.78 and 0.60 for classification of SA and SI for the patient's hospital-stay, respectively. ScAN and ScANER are publicly available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2016

De-identification of Patient Notes with Recurrent Neural Networks

Objective: Patient notes in electronic health records (EHRs) may contain...
research
11/29/2018

HYPE: A High Performing NLP System for Automatically Detecting Hypoglycemia Events from Electronic Health Record Notes

Hypoglycemia is common and potentially dangerous among those treated for...
research
09/11/2018

Toward Automated Early Sepsis Alerting: Identifying Infection Patients from Nursing Notes

Severe sepsis and septic shock are conditions that affect millions of pa...
research
08/17/2022

Summarizing Patients Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models

Automatically summarizing patients' main problems from daily progress no...
research
02/28/2022

PMC-Patients: A Large-scale Dataset of Patient Notes and Relations Extracted from Case Reports in PubMed Central

We present PMC-Patients, a dataset consisting of 167k patient notes with...
research
06/04/2021

CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes

Continuity of care is crucial to ensuring positive health outcomes for p...
research
12/06/2022

Automated Identification of Eviction Status from Electronic Health Record Notes

Objective: Evictions are involved in a cascade of negative events that c...

Please sign up or login with your details

Forgot password? Click here to reset