IvyGPT: InteractiVe Chinese pathwaY language model in medical domain

07/20/2023
by   Rongsheng Wang, et al.
0

General large language models (LLMs) such as ChatGPT have shown remarkable success. However, such LLMs have not been widely adopted for medical purposes, due to poor accuracy and inability to provide medical advice. We propose IvyGPT, an LLM based on LLaMA that is trained and fine-tuned with high-quality medical question-answer (QA) instances and Reinforcement Learning from Human Feedback (RLHF). After supervised fine-tuning, IvyGPT has good multi-turn conversation capabilities, but it cannot perform like a doctor in other aspects, such as comprehensive diagnosis. Through RLHF, IvyGPT can output richer diagnosis and treatment answers that are closer to human. In the training, we used QLoRA to train 33 billion parameters on a small number of NVIDIA A100 (80GB) GPUs. Experimental results show that IvyGPT has outperformed other medical GPT models.

READ FULL TEXT
research
04/14/2023

HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge

Large Language Models (LLMs), such as the LLaMA model, have demonstrated...
research
03/24/2023

ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge

Recent large language models (LLMs) in the general domain, such as ChatG...
research
08/07/2023

Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue

Recent advances in Large Language Models (LLMs) have achieved remarkable...
research
09/05/2023

An Automatic Evaluation Framework for Multi-turn Medical Consultations Capabilities of Large Language Models

Large language models (LLMs) have achieved significant success in intera...
research
04/03/2023

DoctorGLM: Fine-tuning your Chinese Doctor is not a Herculean Task

The recent progress of large language models (LLMs), including ChatGPT a...
research
08/28/2023

DISC-MedLLM: Bridging General Large Language Models and Real-World Medical Consultation

We propose DISC-MedLLM, a comprehensive solution that leverages Large La...
research
08/15/2023

LLM-Mini-CEX: Automatic Evaluation of Large Language Model for Diagnostic Conversation

There is an increasing interest in developing LLMs for medical diagnosis...

Please sign up or login with your details

Forgot password? Click here to reset