Annotating Social Determinants of Health Using Active Learning, and Characterizing Determinants Using Neural Event Extraction

by   Kevin Lybarger, et al.
University of Washington

Social determinants of health (SDOH) affect health outcomes, and knowledge of SDOH can inform clinical decision-making. Automatically extracting SDOH information from clinical text requires data-driven information extraction models trained on annotated corpora that are heterogeneous and frequently include critical SDOH. This work presents a new corpus with SDOH annotations, a novel active learning framework, and the first extraction results on the new corpus. The Social History Annotation Corpus (SHAC) includes 4,480 social history sections with detailed annotation for 12 SDOH characterizing the status, extent, and temporal information of 18K distinct events. We introduce a novel active learning framework that selects samples for annotation using a surrogate text classification task as a proxy for a more complex event extraction task. The active learning framework successfully increases the frequency of health risk factors and improves automatic detection of these events over undirected annotation. An event extraction model trained on SHAC achieves high extraction performance for substance use status (0.82-0.93 F1), employment status (0.81-0.86 F1), and living status type (0.81-0.93 F1) on data from three institutions.


page 1

page 2

page 3

page 4


Prompt-based Extraction of Social Determinants of Health Using Few-shot Learning

Social determinants of health (SDOH) documented in the electronic health...

The 2022 n2c2/UW Shared Task on Extracting Social Determinants of Health

Objective: The n2c2/UW SDOH Challenge explores the extraction of social ...

Events Beyond ACE: Curated Training for Events

We explore a human-driven approach to annotation, curated training (CT),...

Whats New? Identifying the Unfolding of New Events in Narratives

Narratives include a rich source of events unfolding over time and conte...

Crude Oil-related Events Extraction and Processing: A Transfer Learning Approach

One of the challenges in event extraction via traditional supervised lea...

The Benefits of Word Embeddings Features for Active Learning in Clinical Information Extraction

This study investigates the use of unsupervised word embeddings and sequ...

Please sign up or login with your details

Forgot password? Click here to reset