Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review and Replicability Study

04/21/2023
by   Joakim Edin, et al.
0

Medical coding is the task of assigning medical codes to clinical free-text documentation. Healthcare professionals manually assign such codes to track patient diagnoses and treatments. Automated medical coding can considerably alleviate this administrative burden. In this paper, we reproduce, compare, and analyze state-of-the-art automated medical coding machine learning models. We show that several models underperform due to weak configurations, poorly sampled train-test splits, and insufficient evaluation. In previous work, the macro F1 score has been calculated sub-optimally, and our correction doubles it. We contribute a revised model comparison using stratified sampling and identical experimental setups, including hyperparameters and decision boundary tuning. We analyze prediction errors to validate and falsify assumptions of previous works. The analysis confirms that all models struggle with rare codes, while long documents only have a negligible impact. Finally, we present the first comprehensive results on the newly released MIMIC-IV dataset using the reproduced models. We release our code, model parameters, and new MIMIC-III and MIMIC-IV training and evaluation pipelines to accommodate fair future comparisons.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/27/2023

Mimic-IV-ICD: A new benchmark for eXtreme MultiLabel Classification

Clinical notes are assigned ICD codes - sets of codes for diagnoses and ...
research
07/29/2020

Predicting Multiple ICD-10 Codes from Brazilian-Portuguese Clinical Notes

ICD coding from electronic clinical records is a manual, time-consuming ...
research
07/07/2023

MDACE: MIMIC Documents Annotated with Code Evidence

We introduce a dataset for evidence/rationale extraction on an extreme m...
research
06/29/2021

Few-Shot Electronic Health Record Coding through Graph Contrastive Learning

Electronic health record (EHR) coding is the task of assigning ICD codes...
research
06/12/2020

Experimental Evaluation and Development of a Silver-Standard for the MIMIC-III Clinical Coding Dataset

Clinical coding is currently a labour-intensive, error-prone, but critic...
research
11/05/2018

Medical code prediction with multi-view convolution and description-regularized label-dependent attention

A ubiquitous task in processing electronic medical data is the assignmen...
research
03/04/2022

AutoMap: Automatic Medical Code Mapping for Clinical Prediction Model Deployment

Given a deep learning model trained on data from a source site, how to d...

Please sign up or login with your details

Forgot password? Click here to reset