Automated tabulation of clinical trial results: A joint entity and relation extraction approach with transformer-based language representations

12/10/2021
by   Jetsun Whitton, et al.
18

Evidence-based medicine, the practice in which healthcare professionals refer to the best available evidence when making decisions, forms the foundation of modern healthcare. However, it relies on labour-intensive systematic reviews, where domain specialists must aggregate and extract information from thousands of publications, primarily of randomised controlled trial (RCT) results, into evidence tables. This paper investigates automating evidence table generation by decomposing the problem across two language processing tasks: named entity recognition, which identifies key entities within text, such as drug names, and relation extraction, which maps their relationships for separating them into ordered tuples. We focus on the automatic tabulation of sentences from published RCT abstracts that report the results of the study outcomes. Two deep neural net models were developed as part of a joint extraction pipeline, using the principles of transfer learning and transformer-based language representations. To train and test these models, a new gold-standard corpus was developed, comprising almost 600 result sentences from six disease areas. This approach demonstrated significant advantages, with our system performing well across multiple natural language processing tasks and disease areas, as well as in generalising to disease domains unseen during training. Furthermore, we show these results were achievable through training our models on as few as 200 example sentences. The final system is a proof of concept that the generation of evidence tables can be semi-automated, representing a step towards fully automating systematic reviews.

READ FULL TEXT

page 2

page 6

page 11

page 12

page 18

page 19

page 20

research
08/21/2019

Fine-tuning BERT for Joint Entity and Relation Extraction in Chinese Medical Text

Entity and relation extraction is the necessary step in structuring medi...
research
11/18/2019

Drug Repurposing for Cancer: An NLP Approach to Identify Low-Cost Therapies

More than 200 generic drugs approved by the U.S. Food and Drug Administr...
research
05/06/2023

Beyond Rule-based Named Entity Recognition and Relation Extraction for Process Model Generation from Natural Language Text

Automated generation of business process models from natural language te...
research
12/19/2022

Enriching Relation Extraction with OpenIE

Relation extraction (RE) is a sub-discipline of information extraction (...
research
01/30/2020

Data Mining in Clinical Trial Text: Transformers for Classification and Question Answering Tasks

This research on data extraction methods applies recent advances in natu...
research
09/17/2015

Extraction of evidence tables from abstracts of randomized clinical trials using a maximum entropy classifier and global constraints

Systematic use of the published results of randomized clinical trials is...

Please sign up or login with your details

Forgot password? Click here to reset