A Corpus with Multi-Level Annotations of Patients, Interventions and Outcomes to Support Language Processing for Medical Literature

06/11/2018
by   Benjamin Nye, et al.
0

We present a corpus of 5,000 richly annotated abstracts of medical articles describing clinical randomized controlled trials. Annotations include demarcations of text spans that describe the Patient population enrolled, the Interventions studied and to what they were Compared, and the Outcomes measured (the `PICO' elements). These spans are further annotated at a more granular level, e.g., individual interventions within them are marked and mapped onto a structured medical vocabulary. We acquired annotations from a diverse set of workers with varying levels of expertise and cost. We describe our data collection process and the corpus itself in detail. We then outline a set of challenging NLP tasks that would aid searching of the medical literature and the practice of evidence-based medicine.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/05/2023

Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs

Results from Randomized Controlled Trials (RCTs) establish the comparati...
research
06/22/2016

Automated Extraction of Number of Subjects in Randomised Controlled Trials

We present a simple approach for automatically extracting the number of ...
research
10/12/2022

RedHOT: A Corpus of Annotated Medical Questions, Experiences, and Claims on Social Media

We present Reddit Health Online Talk (RedHOT), a corpus of 22,000 richly...
research
09/17/2015

Extraction of evidence tables from abstracts of randomized clinical trials using a maximum entropy classifier and global constraints

Systematic use of the published results of randomized clinical trials is...
research
07/03/2019

Clustering of Medical Free-Text Records Based on Word Embeddings

Is it true that patients with similar conditions get similar diagnoses? ...
research
04/21/2019

A Study on Agreement in PICO Span Annotations

In evidence-based medicine, relevance of medical literature is determine...
research
04/03/2023

Enhancing Clinical Evidence Recommendation with Multi-Channel Heterogeneous Learning on Evidence Graphs

Clinical evidence encompasses the associations and impacts between patie...

Please sign up or login with your details

Forgot password? Click here to reset