Multi-label natural language processing to identify diagnosis and procedure codes from MIMIC-III inpatient notes

03/17/2020
by   A. K. Bhavani Singh, et al.
0

In the United States, 25 spending accounts for administrative costs that involve services for medical coding and billing. With the increasing number of patient records, manual assignment of the codes performed is overwhelming, time-consuming and error-prone, causing billing errors. Natural language processing can automate the extraction of codes/labels from unstructured clinical notes, which can aid human coders to save time, increase productivity, and verify medical coding errors. Our objective is to identify appropriate diagnosis and procedure codes from clinical notes by performing multi-label classification. We used de-identified data of critical care patients from the MIMIC-III database and subset the data to select the ten (top-10) and fifty (top-50) most common diagnoses and procedures, which covers 47.45 respectively. We implemented state-of-the-art Bidirectional Encoder Representations from Transformers (BERT) to fine-tune the language model on 80 of the data and validated on the remaining 20 accuracy of 87.08 codes. For the top-50 codes, our model achieved an overall accuracy of 93.76 an F1 score of 92.24 research, our model outperforms in predicting codes from the clinical text. We discuss approaches to generalize the knowledge discovery process of our MIMIC-BERT to other clinical notes. This can help human coders to save time, prevent backlogs, and additional costs due to coding errors.

READ FULL TEXT
research
12/28/2019

Natural language processing of MIMIC-III clinical notes for identifying diagnosis and procedures with neural networks

Coding diagnosis and procedures in medical records is a crucial process ...
research
06/15/2021

Medical Code Prediction from Discharge Summary: Document to Sequence BERT using Sequence Attention

Clinical notes are unstructured text generated by clinicians during pati...
research
05/26/2020

BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining

Clinical interactions are initially recorded and documented in free text...
research
03/11/2021

Does the Magic of BERT Apply to Medical Code Assignment? A Quantitative Study

Unsupervised pretraining is an integral part of many natural language pr...
research
06/24/2022

Classifying Unstructured Clinical Notes via Automatic Weak Supervision

Healthcare providers usually record detailed notes of the clinical care ...
research
09/27/2017

Multi-Label Classification of Patient Notes a Case Study on ICD Code Assignment

In the context of the Electronic Health Record, automated diagnosis codi...
research
08/15/2022

Entity Anchored ICD Coding

Medical coding is a complex task, requiring assignment of a subset of ov...

Please sign up or login with your details

Forgot password? Click here to reset