Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding

05/19/2023
by   Augustin Toma, et al.
0

Large Language Models (LLMs) present immense potential in the medical field, yet concerns over data privacy, regulatory compliance, and model stability restrict their widespread adoption. Although the distillation of high-performing closed-source LLMs has proven effective for general tasks, their application in healthcare is limited due to reduced domain knowledge and remnants of alignment behavior hindering clinical tasks. To address these challenges, we propose Dialogue-Based Knowledge Encoding (DBKE). DBKE enhances models' implicit knowledge base and primes them for conversational recall, augmenting their conversational capabilities and enabling a soft alignment for subsequent use cases. By transforming dense academic source text into synthetic dialogue, DBKE broadens the model's knowledge base and enables a soft alignment that guides downstream behaviours. We present Clinical Camel, an open-source, healthcare-focused conversational model, to showcase the effectiveness of DBKE. Clinical Camel outperforms GPT-3.5 on the United States Medical Licensing Examination (USMLE) Step 1 and Step 3 with scores of 53.2 respectively, compared to GPT-3.5's scores of 36.1 adeptly handles multi-stage clinical case problems, provides adaptive counseling, and generates clinical notes. However, it is prone to hallucinations, which pose a significant obstacle in safety-critical settings. The performance of Clinical Camel underscores the importance of continued research and development of open-source models for the safe and effective integration of LLMs in healthcare settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2023

Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes

The development of large language models tailored for handling patients'...
research
05/09/2023

Effective Medical Code Prediction via Label Internal Alignment

The clinical notes are usually typed into the system by physicians. They...
research
06/13/2023

XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models

The latest breakthroughs in large vision-language models, such as Bard a...
research
07/11/2023

SuryaKiran at MEDIQA-Sum 2023: Leveraging LoRA for Clinical Dialogue Summarization

Finetuning Large Language Models helps improve the results for domain-sp...
research
05/22/2023

Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting

Large language models (LLMs) demonstrate remarkable medical expertise, b...
research
08/19/2022

Graph-Augmented Cyclic Learning Framework for Similarity Estimation of Medical Clinical Notes

Semantic textual similarity (STS) in the clinical domain helps improve d...
research
06/28/2023

Beyond the Hype: Assessing the Performance, Trustworthiness, and Clinical Suitability of GPT3.5

The use of large language models (LLMs) in healthcare is gaining popular...

Please sign up or login with your details

Forgot password? Click here to reset