The Potential and Pitfalls of using a Large Language Model such as ChatGPT or GPT-4 as a Clinical Assistant

07/16/2023
by   Jingqing Zhang, et al.
0

Recent studies have demonstrated promising performance of ChatGPT and GPT-4 on several medical domain tasks. However, none have assessed its performance using a large-scale real-world electronic health record database, nor have evaluated its utility in providing clinical diagnostic assistance for patients across a full range of disease presentation. We performed two analyses using ChatGPT and GPT-4, one to identify patients with specific medical diagnoses using a real-world large electronic health record database and the other, in providing diagnostic assistance to healthcare workers in the prospective evaluation of hypothetical patients. Our results show that GPT-4 across disease classification tasks with chain of thought and few-shot prompting can achieve performance as high as 96 accurately diagnose three out of four times. However, there were mentions of factually incorrect statements, overlooking crucial medical findings, recommendations for unnecessary investigations and overtreatment. These issues coupled with privacy concerns, make these models currently inadequate for real world clinical use. However, limited data and time needed for prompt engineering in comparison to configuration of conventional machine learning workflows highlight their potential for scalability across healthcare applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/23/2020

Clinical Recommender System: Predicting Medical Specialty Diagnostic Choices with Neural Network Ensembles

The growing demand for key healthcare resources such as clinical experti...
research
06/26/2023

HeaRT: Health Record Timeliner to visualise patients' medical history from health record text

Electronic health records (EHRs), which contain patients' medical histor...
research
10/17/2019

Generalized Mixed Modeling in Massive Electronic Health Record Databases: what is a healthy serum potassium?

Converting electronic health record (EHR) entries to useful clinical inf...
research
05/19/2021

Dark Patterns, Electronic Medical Records, and the Opioid Epidemic

Dark patterns have emerged as a set of methods to exploit cognitive bias...
research
11/06/2020

Trust Issues: Uncertainty Estimation Does Not Enable Reliable OOD Detection On Medical Tabular Data

When deploying machine learning models in high-stakes real-world environ...
research
05/04/2021

Supervised multi-specialist topic model with applications on large-scale electronic health record data

Motivation: Electronic health record (EHR) data provides a new venue to ...
research
04/21/2023

SkinGPT: A Dermatology Diagnostic System with Vision Large Language Model

Skin and subcutaneous diseases are among the major causes of the nonfata...

Please sign up or login with your details

Forgot password? Click here to reset