Learning Contextualized Document Representations for Healthcare Answer Retrieval

02/03/2020
by   Sebastian Arnold, et al.
0

We present Contextual Discourse Vectors (CDV), a distributed document representation for efficient answer retrieval from long healthcare documents. Our approach is based on structured query tuples of entities and aspects from free text and medical taxonomies. Our model leverages a dual encoder architecture with hierarchical LSTM layers and multi-task training to encode the position of clinical entities and aspects alongside the document discourse. We use our continuous representations to resolve queries with short latency using approximate nearest neighbor search on sentence level. We apply the CDV model for retrieving coherent answer passages from nine English public health resources from the Web, addressing both patients and medical professionals. Because there is no end-to-end training data available for all application scenarios, we train our model with self-supervised data from Wikipedia. We show that our generalized model significantly outperforms several state-of-the-art baselines for healthcare passage ranking and is able to adapt to heterogeneous domains without additional fine-tuning.

READ FULL TEXT
research
08/02/2021

Self-supervised Answer Retrieval on Clinical Notes

Retrieving answer passages from long documents is a complex task requiri...
research
02/01/2022

Improving BERT-based Query-by-Document Retrieval with Multi-Task Optimization

Query-by-document (QBD) retrieval is an Information Retrieval task in wh...
research
12/21/2020

An End-to-End Document-Level Neural Discourse Parser Exploiting Multi-Granularity Representations

Document-level discourse parsing, in accordance with the Rhetorical Stru...
research
08/31/2021

Medical SANSformers: Training self-supervised transformers without attention for Electronic Medical Records

We leverage deep sequential models to tackle the problem of predicting h...
research
09/05/2022

Query-focused Extractive Summarisation for Biomedical and COVID-19 Complex Question Answering

This paper presents Macquarie University's participation to the two most...
research
10/22/2017

Bringing Semantic Structures to User Intent Detection in Online Medical Queries

The Internet has revolutionized healthcare by offering medical informati...
research
10/31/2018

Effective Feature Representation for Clinical Text Concept Extraction

Crucial information about the practice of healthcare is recorded only in...

Please sign up or login with your details

Forgot password? Click here to reset