Bio-SIEVE: Exploring Instruction Tuning Large Language Models for Systematic Review Automation

08/12/2023
by   Ambrose Robinson, et al.
0

Medical systematic reviews can be very costly and resource intensive. We explore how Large Language Models (LLMs) can support and be trained to perform literature screening when provided with a detailed set of selection criteria. Specifically, we instruction tune LLaMA and Guanaco models to perform abstract screening for medical systematic reviews. Our best model, Bio-SIEVE, outperforms both ChatGPT and trained traditional approaches, and generalises better across medical domains. However, there remains the challenge of adapting the model to safety-first scenarios. We also explore the impact of multi-task training with Bio-SIEVE-Multi, including tasks such as PICO extraction and exclusion reasoning, but find that it is unable to match single-task Bio-SIEVE's performance. We see Bio-SIEVE as an important step towards specialising LLMs for the biomedical systematic review process and explore its future developmental opportunities. We release our models, code and a list of DOIs to reconstruct our dataset for reproducibility.

READ FULL TEXT

page 4

page 7

research
12/18/2022

Neural Rankers for Effective Screening Prioritisation in Medical Systematic Review Literature Search

Medical systematic reviews typically require assessing all the documents...
research
08/22/2019

Viability of machine learning to reduce workload in systematic review screenings in the health sciences: a working paper

Systematic reviews, which summarize and synthesize all the current resea...
research
07/12/2023

Assessing the Ability of ChatGPT to Screen Articles for Systematic Reviews

By organizing knowledge within a research field, Systematic Reviews (SR)...
research
09/11/2023

Generating Natural Language Queries for More Effective Systematic Review Screening Prioritisation

Screening prioritisation in medical systematic reviews aims to rank the ...
research
05/01/2023

Automated Paper Screening for Clinical Reviews Using Large Language Models

Objective: To assess the performance of the OpenAI GPT API in accurately...
research
01/29/2018

Improving Active Learning in Systematic Reviews

Systematic reviews are essential to summarizing the results of different...
research
01/19/2022

Automation of Citation Screening for Systematic Literature Reviews using Neural Networks: A Replicability Study

In the process of Systematic Literature Review, citation screening is es...

Please sign up or login with your details

Forgot password? Click here to reset