DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing

09/29/2022
by   Yanjun Gao, et al.
19

The meaningful use of electronic health records (EHR) continues to progress in the digital era with clinical decision support systems augmented by artificial intelligence. A priority in improving provider experience is to overcome information overload and reduce the cognitive burden so fewer medical errors and cognitive biases are introduced during patient care. One major type of medical error is diagnostic error due to systematic or predictable errors in judgment that rely on heuristics. The potential for clinical natural language processing (cNLP) to model diagnostic reasoning in humans with forward reasoning from data to diagnosis and potentially reduce the cognitive burden and medical error has not been investigated. Existing tasks to advance the science in cNLP have largely focused on information extraction and named entity recognition through classification tasks. We introduce a novel suite of tasks coined as Diagnostic Reasoning Benchmarks, DR.BENCH, as a new benchmark for developing and evaluating cNLP models with clinical diagnostic reasoning ability. The suite includes six tasks from ten publicly available datasets addressing clinical text understanding, medical knowledge reasoning, and diagnosis generation. DR.BENCH is the first clinical suite of tasks designed to be a natural language generation framework to evaluate pre-trained language models. Experiments with state-of-the-art pre-trained generative language models using large general domain models and models that were continually trained on a medical corpus demonstrate opportunities for improvement when evaluated in DR. BENCH. We share DR. BENCH as a publicly available GitLab repository with a systematic approach to load and evaluate models for the cNLP community.

READ FULL TEXT
research
07/18/2023

Large Language Models Perform Diagnostic Reasoning

We explore the extension of chain-of-thought (CoT) prompting to medical ...
research
08/28/2023

Leveraging A Medical Knowledge Graph into Large Language Models for Diagnosis Prediction

Electronic Health Records (EHRs) and routine documentation practices pla...
research
06/07/2023

Multi-Task Training with In-Domain Language Models for Diagnostic Reasoning

Generative artificial intelligence (AI) is a promising direction for aug...
research
08/13/2023

Diagnostic Reasoning Prompts Reveal the Potential for Large Language Model Interpretability in Medicine

One of the major barriers to using large language models (LLMs) in medic...
research
04/06/2022

Hierarchical Annotation for Building A Suite of Clinical Natural Language Processing Tasks: Progress Note Understanding

Applying methods in natural language processing on electronic health rec...
research
07/08/2022

A Medical Information Extraction Workbench to Process German Clinical Text

Background: In the information extraction and natural language processin...
research
03/14/2023

Progress Note Understanding – Assessment and Plan Reasoning: Overview of the 2022 N2C2 Track 3 Shared Task

Daily progress notes are common types in the electronic health record (E...

Please sign up or login with your details

Forgot password? Click here to reset