Self-supervised Answer Retrieval on Clinical Notes

08/02/2021
by   Paul Grundmann, et al.
0

Retrieving answer passages from long documents is a complex task requiring semantic understanding of both discourse and document context. We approach this challenge specifically in a clinical scenario, where doctors retrieve cohorts of patients based on diagnoses and other latent medical aspects. We introduce CAPR, a rule-based self-supervision objective for training Transformer language models for domain-specific passage matching. In addition, we contribute a novel retrieval dataset based on clinical notes to simulate this scenario on a large corpus of clinical notes. We apply our objective in four Transformer-based architectures: Contextual Document Vectors, Bi-, Poly- and Cross-encoders. From our extensive evaluation on MIMIC-III and three other healthcare datasets, we report that CAPR outperforms strong baselines in the retrieval of domain-specific passages and effectively generalizes across rule-based and human-labeled passages. This makes the model powerful especially in zero-shot scenarios where only limited training data is available.

READ FULL TEXT
research
02/03/2020

Learning Contextualized Document Representations for Healthcare Answer Retrieval

We present Contextual Discourse Vectors (CDV), a distributed document re...
research
07/13/2023

Making the Most Out of the Limited Context Length: Predictive Power Varies with Clinical Note Type and Note Section

Recent advances in large language models have led to renewed interest in...
research
04/17/2021

Hierarchical Transformer Networks for Longitudinal Clinical Document Classification

We present the Hierarchical Transformer Networks for modeling long-term ...
research
08/17/2022

Summarizing Patients Problems from Hospital Progress Notes Using Pre-trained Sequence-to-Sequence Models

Automatically summarizing patients' main problems from daily progress no...
research
10/12/2022

Developing a general-purpose clinical language inference model from a large corpus of clinical notes

Several biomedical language models have already been developed for clini...
research
11/13/2018

Embedding Electronic Health Records for Clinical Information Retrieval

Neural network representation learning frameworks have recently shown to...
research
01/09/2023

Leveraging Contextual Relatedness to Identify Suicide Documentation in Clinical Notes through Zero Shot Learning

Identifying suicidality including suicidal ideation, attempts, and risk ...

Please sign up or login with your details

Forgot password? Click here to reset