Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews

05/19/2023
by   Hye Sun Yun, et al.
0

Medical systematic reviews are crucial for informing clinical decision making and healthcare policy. But producing such reviews is onerous and time-consuming. Thus, high-quality evidence synopses are not available for many questions and may be outdated even when they are available. Large language models (LLMs) are now capable of generating long-form texts, suggesting the tantalizing possibility of automatically generating literature reviews on demand. However, LLMs sometimes generate inaccurate (and potentially misleading) texts by hallucinating or omitting important information. In the healthcare context, this may render LLMs unusable at best and dangerous at worst. Most discussion surrounding the benefits and risks of LLMs have been divorced from specific applications. In this work, we seek to qualitatively characterize the potential utility and risks of LLMs for assisting in production of medical evidence reviews. We conducted 16 semi-structured interviews with international experts in systematic reviews, grounding discussion in the context of generating evidence reviews. Domain experts indicated that LLMs could aid writing reviews, as a tool for drafting or creating plain language summaries, generating templates or suggestions, distilling information, crosschecking, and synthesizing or interpreting text inputs. But they also identified issues with model outputs and expressed concerns about potential downstream harms of confidently composed but inaccurate LLM outputs which might mislead. Other anticipated potential downstream harms included lessened accountability and proliferation of automatically generated reviews that might be of low quality. Informed by this qualitative analysis, we identify criteria for rigorous evaluation of biomedical LLMs aligned with domain expert views.

READ FULL TEXT

page 3

page 11

page 16

page 27

page 28

page 29

page 32

page 33

research
12/17/2021

Search Strategy Formulation for Systematic Reviews: issues, challenges and opportunities

Systematic literature reviews play a vital role in identifying the best ...
research
02/03/2023

Can ChatGPT Write a Good Boolean Query for Systematic Review Literature Search?

Systematic reviews are comprehensive reviews of the literature for a hig...
research
09/19/2022

Automated MeSH Term Suggestion for Effective Query Formulation in Systematic Reviews Literature Search

High-quality medical systematic reviews require comprehensive literature...
research
08/25/2020

Generating (Factual?) Narrative Summaries of RCTs: Experiments with Neural Multi-Document Summarization

We consider the problem of automatically generating a narrative biomedic...
research
05/10/2023

Summarizing, Simplifying, and Synthesizing Medical Evidence Using GPT-3 (with Varying Success)

Large language models, particularly GPT-3, are able to produce high qual...
research
08/22/2019

Viability of machine learning to reduce workload in systematic review screenings in the health sciences: a working paper

Systematic reviews, which summarize and synthesize all the current resea...
research
12/01/2021

MeSH Term Suggestion for Systematic Review Literature Search

High-quality medical systematic reviews require comprehensive literature...

Please sign up or login with your details

Forgot password? Click here to reset