Distilling Large Language Models for Biomedical Knowledge Extraction: A Case Study on Adverse Drug Events

07/12/2023
by   Yu Gu, et al.
0

Large language models (LLMs), such as GPT-4, have demonstrated remarkable capabilities across a wide range of tasks, including health applications. In this paper, we study how LLMs can be used to scale biomedical knowledge curation. We find that while LLMs already possess decent competency in structuring biomedical text, by distillation into a task-specific student model through self-supervised learning, substantial gains can be attained over out-of-box LLMs, with additional advantages such as cost, efficiency, and white-box model access. We conduct a case study on adverse drug event (ADE) extraction, which is an important area for improving care. On standard ADE extraction evaluation, a GPT-3.5 distilled PubMedBERT model attained comparable accuracy as supervised state-of-the-art models without using any labeled data. Despite being over 1,000 times smaller, the distilled model outperformed its teacher GPT-3.5 by over 6 absolute points in F1 and GPT-4 by over 5 absolute points. Ablation studies on distillation model choice (e.g., PubMedBERT vs BioGPT) and ADE extraction architecture shed light on best practice for biomedical knowledge extraction. Similar gains were attained by distillation for other standard biomedical knowledge extraction tasks such as gene-disease associations and protected health information, further illustrating the promise of this approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/07/2023

UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition

Large language models (LLMs) have demonstrated remarkable generalizabili...
research
05/22/2023

BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance

Timely and accurate extraction of Adverse Drug Events (ADE) from biomedi...
research
10/22/2022

PHEE: A Dataset for Pharmacovigilance Event Extraction from Text

The primary goal of drug safety researchers and regulators is to promptl...
research
01/02/2018

An Attentive Sequence Model for Adverse Drug Event Extraction from Biomedical Text

Adverse reaction caused by drugs is a potentially dangerous problem whic...
research
06/28/2023

LLM Calibration and Automatic Hallucination Detection via Pareto Optimal Self-supervision

Large language models (LLMs) have demonstrated remarkable capabilities o...
research
11/01/2020

MixKD: Towards Efficient Distillation of Large-scale Language Models

Large-scale language models have recently demonstrated impressive empiri...
research
05/24/2023

Is GPT-4 a Good Data Analyst?

As large language models (LLMs) have demonstrated their powerful capabil...

Please sign up or login with your details

Forgot password? Click here to reset