Dr. LLaMA: Improving Small Language Models in Domain-Specific QA via Generative Data Augmentation

05/12/2023
by   Zhen Guo, et al.
0

Large Language Models (LLMs) have made significant strides in natural language processing but face challenges in terms of computational expense and inefficiency as they grow in size, especially in domain-specific tasks. Small Language Models (SLMs), on the other hand, often struggle in these tasks due to limited capacity and training data. In this paper, we introduce Dr. LLaMA, a method for improving SLMs through generative data augmentation using LLMs, focusing on medical question-answering tasks and the PubMedQA dataset. Our findings indicate that LLMs effectively refine and diversify existing question-answer pairs, resulting in improved performance of a much smaller model on domain-specific QA datasets after fine-tuning. This study highlights the challenges of using LLMs for domain-specific question answering and suggests potential research directions to address these limitations, ultimately aiming to create more efficient and capable models for specialized applications. We have also made our code available for interested researchers

READ FULL TEXT
research
08/07/2023

KITLM: Domain-Specific Knowledge InTegration into Language Models for Question Answering

Large language models (LLMs) have demonstrated remarkable performance in...
research
09/02/2023

LeanContext: Cost-Efficient Domain-Specific Question Answering Using LLMs

Question-answering (QA) is a significant application of Large Language M...
research
08/03/2023

Domain specificity and data efficiency in typo tolerant spell checkers: the case of search in online marketplaces

Typographical errors are a major source of frustration for visitors of o...
research
05/12/2023

When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust

Large language models (LLMs) have significantly advanced the field of na...
research
01/10/2023

There is No Big Brother or Small Brother: Knowledge Infusion in Language Models for Link Prediction and Question Answering

The integration of knowledge graphs with deep learning is thriving in im...
research
09/05/2023

Augmenting Black-box LLMs with Medical Textbooks for Clinical Question Answering

Large-scale language models (LLMs), such as ChatGPT, are capable of gene...
research
09/14/2023

CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

In recent years, large language models (LLMs) have shown remarkable capa...

Please sign up or login with your details

Forgot password? Click here to reset