DrugEHRQA: A Question Answering Dataset on Structured and Unstructured Electronic Health Records For Medicine Related Queries

05/03/2022
by   Jayetri Bardhan, et al.
0

This paper develops the first question answering dataset (DrugEHRQA) containing question-answer pairs from both structured tables and unstructured notes from a publicly available Electronic Health Record (EHR). EHRs contain patient records, stored in structured tables and unstructured clinical notes. The information in structured and unstructured EHRs is not strictly disjoint: information may be duplicated, contradictory, or provide additional context between these sources. Our dataset has medication-related queries, containing over 70,000 question-answer pairs. To provide a baseline model and help analyze the dataset, we have used a simple model (MultimodalEHRQA) which uses the predictions of a modality selection network to choose between EHR tables and clinical notes to answer the questions. This is used to direct the questions to the table-based or text-based state-of-the-art QA model. In order to address the problem arising from complex, nested queries, this is the first time Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers (RAT-SQL) has been used to test the structure of query templates in EHR data. Our goal is to provide a benchmark dataset for multi-modal QA systems, and to open up new avenues of research in improving question answering over EHR structured data by using context from unstructured clinical data.

READ FULL TEXT

page 4

page 6

research
10/20/2020

Open Question Answering over Tables and Text

In open question answering (QA), the answer to a question is produced by...
research
05/15/2023

Question-Answering System Extracts Information on Injection Drug Use from Clinical Progress Notes

Injection drug use (IDU) is a dangerous health behavior that increases m...
research
08/05/2021

Dual Reader-Parser on Hybrid Textual and Tabular Evidence for Open Domain Question Answering

The current state-of-the-art generative models for open-domain question ...
research
03/14/2022

Uncertainty-Aware Text-to-Program for Question Answering on Structured Electronic Health Records

Question Answering on Electronic Health Records (EHR-QA) has a significa...
research
11/14/2021

Question Answering for Complex Electronic Health Records Database using Unified Encoder-Decoder Architecture

An intelligent machine that can answer human questions based on electron...
research
07/06/2022

BioTABQA: Instruction Learning for Biomedical Table Question Answering

Table Question Answering (TQA) is an important but under-explored task. ...
research
04/07/2022

Parameter-Efficient Abstractive Question Answering over Tables or Text

A long-term ambition of information seeking QA systems is to reason over...

Please sign up or login with your details

Forgot password? Click here to reset