Modeling Diagnostic Label Correlation for Automatic ICD Coding

06/24/2021
by   Shang-Chi Tsai, et al.
15

Given the clinical notes written in electronic health records (EHRs), it is challenging to predict the diagnostic codes which is formulated as a multi-label classification task. The large set of labels, the hierarchical dependency, and the imbalanced data make this prediction task extremely hard. Most existing work built a binary prediction for each label independently, ignoring the dependencies between labels. To address this problem, we propose a two-stage framework to improve automatic ICD coding by capturing the label correlation. Specifically, we train a label set distribution estimator to rescore the probability of each label set candidate generated by a base predictor. This paper is the first attempt at learning the label set distribution as a reranking module for medical code prediction. In the experiments, our proposed framework is able to improve upon best-performing predictors on the benchmark MIMIC datasets. The source code of this project is available at https://github.com/MiuLab/ICD-Correlation.

READ FULL TEXT
research
05/29/2023

TreeMAN: Tree-enhanced Multimodal Attention Network for ICD Coding

ICD coding is designed to assign the disease codes to electronic health ...
research
07/12/2022

PLM-ICD: Automatic ICD Coding with Pretrained Language Models

Automatically classifying electronic health records (EHRs) into diagnost...
research
09/10/2021

Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution

Multi-label text classification is a challenging task because it require...
research
10/29/2020

Explainable Automated Coding of Clinical Notes using Hierarchical Label-wise Attention Networks and Label Embedding Initialisation

Diagnostic or procedural coding of clinical notes aims to derive a coded...
research
07/13/2020

A Label Attention Model for ICD Coding from Clinical Text

ICD coding is a process of assigning the International Classification of...
research
07/07/2023

MDACE: MIMIC Documents Annotated with Code Evidence

We introduce a dataset for evidence/rationale extraction on an extreme m...
research
12/16/2020

Collaborative residual learners for automatic icd10 prediction using prescribed medications

Clinical coding is an administrative process that involves the translati...

Please sign up or login with your details

Forgot password? Click here to reset