DeepTag: inferring all-cause diagnoses from clinical notes in under-resourced medical domain

06/28/2018
by   Allen Nie, et al.
0

In many under-resourced settings, clinicians lack time and expertise to annotate patients with standard medical diagnosis codes. Veterinary medicine is an example of this and clinical encounters are largely captured in free text notes which are not labeled with diagnosis code. The lack of such standard coding makes it challenging to apply data science to improve patient care. It is also a major impediment to translational research, where, for example, we would like to leverage veterinary data to inform drug development for humans. We develop a deep learning algorithm, DeepTag, to automatically infer diagnosis codes from veterinarian free text notes. DeepTag is trained on a newly curated dataset of 112,558 veterinary notes manually annotated by experts. DeepTag extends multi-task LSTM with an improved hierarchical objective that captures structures between diseases. To foster human-machine collaboration, DeepTag also learns to abstain in examples when it is uncertain and defer them to human experts, resulting in improved performance of the model. DeepTag accurately infers disease codes from free text even in challenging out-of-domain settings where the text comes from different clinics than the ones used for training. It enables automated disease annotation across a broad range of clinical diagnoses with minimal pre-processing. The technical framework in this work can be applied in other medical domains that currently lack medical coding infrastructure.

READ FULL TEXT

page 1

page 3

research
08/24/2021

Identification of Pediatric Respiratory Diseases Using Fine-grained Diagnosis System

Respiratory diseases, including asthma, bronchitis, pneumonia, and upper...
research
08/15/2018

Deep EHR: Chronic Disease Prediction Using Medical Notes

Early detection of preventable diseases is important for better disease ...
research
07/14/2022

GrabQC: Graph based Query Contextualization for automated ICD coding

Automated medical coding is a process of codifying clinical notes to app...
research
05/20/2022

Semi-self-supervised Automated ICD Coding

Clinical Text Notes (CTNs) contain physicians' reasoning process, writte...
research
02/27/2021

Lifelong Learning based Disease Diagnosis on Clinical Notes

Current deep learning based disease diagnosis systems usually fall short...
research
11/29/2018

Large-scale Generative Modeling to Improve Automated Veterinary Disease Coding

Supervised learning is limited both by the quantity and quality of the l...
research
12/16/2020

Ensemble model for pre-discharge icd10 coding prediction

The translation of medical diagnosis to clinical coding has wide range o...

Please sign up or login with your details

Forgot password? Click here to reset