DeepAI AI Chat
Log In Sign Up

PMC-Patients: A Large-scale Dataset of Patient Notes and Relations Extracted from Case Reports in PubMed Central

02/28/2022
by   Zhengyun Zhao, et al.
Tsinghua University
13

We present PMC-Patients, a dataset consisting of 167k patient notes with 3.1M relevant article annotations and 293k similar patient annotations. The patient notes are extracted by identifying certain sections from case reports in PubMed Central, and those with at least CC BY-NC-SA license are re-distributed. Patient-article relevance and patient-patient similarity are defined by citation relationships in PubMed. We also perform four tasks with PMC-Patients to demonstrate its utility, including Patient Note Recognition (PNR), Patient-Patient Similarity (PPS), Patient-Patient Retrieval (PPR), and Patient-Article Retrieval (PAR). In summary, PMC-Patients provides the largest-scale patient notes with high quality, diverse conditions, easy access, and rich annotations.

READ FULL TEXT

page 2

page 4

page 5

page 8

05/04/2020

Generating SOAP Notes from Doctor-Patient Conversations

Following each patient visit, physicians must draft detailed clinical su...
08/08/2018

Learning to Write Notes in Electronic Health Records

Clinicians spend a significant amount of time inputting free-form textua...
02/22/2021

Distributed Application of Guideline-Based Decision Support through Mobile Devices: Implementation and Evaluation

Traditionally Guideline(GL)based Decision Support Systems (DSSs) use a c...
05/12/2022

ScAN: Suicide Attempt and Ideation Events Dataset

Suicide is an important public health concern and one of the leading cau...
04/09/2020

Query-Focused EHR Summarization to Aid Imaging Diagnosis

Electronic Health Records (EHRs) provide vital contextual information to...
06/29/2022

Why patient data cannot be easily forgotten?

Rights provisioned within data protection regulations, permit patients t...

Code Repositories