Semi-self-supervised Automated ICD Coding

05/20/2022
by   Hlynur D. Hlynsson, et al.
0

Clinical Text Notes (CTNs) contain physicians' reasoning process, written in an unstructured free text format, as they examine and interview patients. In recent years, several studies have been published that provide evidence for the utility of machine learning for predicting doctors' diagnoses from CTNs, a task known as ICD coding. Data annotation is time consuming, particularly when a degree of specialization is needed, as is the case for medical data. This paper presents a method of augmenting a sparsely annotated dataset of Icelandic CTNs with a machine-learned imputation in a semi-self-supervised manner. We train a neural network on a small set of annotated CTNs and use it to extract clinical features from a set of un-annotated CTNs. These clinical features consist of answers to about a thousand potential questions that a physician might find the answers to during a consultation of a patient. The features are then used to train a classifier for the diagnosis of certain types of diseases. We report the results of an evaluation of this data augmentation method over three tiers of data availability to the physician. Our data augmentation method shows a significant positive effect which is diminished when clinical features from the examination of the patient and diagnostics are made available. We recommend our method for augmenting scarce datasets for systems that take decisions based on clinical features that do not include examinations or tests.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2019

Hybrid Text Feature Modeling for Disease Group Prediction using Unstructured Physician Notes

Existing Clinical Decision Support Systems (CDSSs) largely depend on the...
research
06/28/2018

DeepTag: inferring all-cause diagnoses from clinical notes in under-resourced medical domain

In many under-resourced settings, clinicians lack time and expertise to ...
research
06/24/2022

Classifying Unstructured Clinical Notes via Automatic Weak Supervision

Healthcare providers usually record detailed notes of the clinical care ...
research
10/27/2021

SCALP – Supervised Contrastive Learning for Cardiopulmonary Disease Classification and Localization in Chest X-rays using Patient Metadata

Computer-aided diagnosis plays a salient role in more accessible and acc...
research
09/24/2019

LitGen: Genetic Literature Recommendation Guided by Human Explanations

As genetic sequencing costs decrease, the lack of clinical interpretatio...
research
09/04/2021

Self-Supervised Detection of Contextual Synonyms in a Multi-Class Setting: Phenotype Annotation Use Case

Contextualised word embeddings is a powerful tool to detect contextual s...
research
05/16/2018

Adversarial Training for Patient-Independent Feature Learning with IVOCT Data for Plaque Classification

Deep learning methods have shown impressive results for a variety of med...

Please sign up or login with your details

Forgot password? Click here to reset