Comparative Analysis of Drug-GPT and ChatGPT LLMs for Healthcare Insights: Evaluating Accuracy and Relevance in Patient and HCP Contexts

by   Giorgos Lysandrou, et al.

This study presents a comparative analysis of three Generative Pre-trained Transformer (GPT) solutions in a question and answer (Q A) setting: Drug-GPT 3, Drug-GPT 4, and ChatGPT, in the context of healthcare applications. The objective is to determine which model delivers the most accurate and relevant information in response to prompts related to patient experiences with atopic dermatitis (AD) and healthcare professional (HCP) discussions about diabetes. The results demonstrate that while all three models are capable of generating relevant and accurate responses, Drug-GPT 3 and Drug-GPT 4, which are supported by curated datasets of patient and HCP social media and message board posts, provide more targeted and in-depth insights. ChatGPT, a more general-purpose model, generates broader and more general responses, which may be valuable for readers seeking a high-level understanding of the topics but may lack the depth and personal insights found in the answers generated by the specialized Drug-GPT models. This comparative analysis highlights the importance of considering the language model's perspective, depth of knowledge, and currency when evaluating the usefulness of generated information in healthcare applications.


page 1

page 2

page 3

page 4


Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery

Despite growing interest in using large language models (LLMs) in health...

Revealing Patient-Reported Experiences in Healthcare from Social Media using the DAPMAV Framework

Understanding patient experience in healthcare is increasingly important...

A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19

COVID-19 has resulted in an ongoing pandemic and as of 12 June 2020, has...

Conceptualising Healthcare-Seeking as an Activity to Explain Technology Use: A Case of M-health

The purpose of this paper is to engage with the Information Systems' con...

Performance of the Pre-Trained Large Language Model GPT-4 on Automated Short Answer Grading

Automated Short Answer Grading (ASAG) has been an active area of machine...

The Scope of In-Context Learning for the Extraction of Medical Temporal Constraints

Medications often impose temporal constraints on everyday patient activi...

Computational modeling of in-stent restenosis: Pharmacokinetic and pharmacodynamic evaluation

Persistence of the pathology of in-stent restenosis even with the advent...

Please sign up or login with your details

Forgot password? Click here to reset