Active learning for medical code assignment

04/12/2021
by   Martha Dais Ferreira, et al.
0

Machine Learning (ML) is widely used to automatically extract meaningful information from Electronic Health Records (EHR) to support operational, clinical, and financial decision-making. However, ML models require a large number of annotated examples to provide satisfactory results, which is not possible in most healthcare scenarios due to the high cost of clinician-labeled data. Active Learning (AL) is a process of selecting the most informative instances to be labeled by an expert to further train a supervised algorithm. We demonstrate the effectiveness of AL in multi-label text classification in the clinical domain. In this context, we apply a set of well-known AL methods to help automatically assign ICD-9 codes on the MIMIC-III dataset. Our results show that the selection of informative instances provides satisfactory classification with a significantly reduced training set (8.3% of the total instances). We conclude that AL methods can significantly reduce the manual annotation cost while preserving model performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2023

Active Learning for Natural Language Generation

The field of text generation suffers from a severe shortage of labeled d...
research
07/08/2023

Active Learning in Physics: From 101, to Progress, and Perspective

Active Learning (AL) is a family of machine learning (ML) algorithms tha...
research
06/13/2021

Active Learning for Network Traffic Classification: A Technical Study

Network Traffic Classification (NTC) has become an important feature in ...
research
11/21/2018

Robust Active Learning for Electrocardiographic Signal Classification

The classification of electrocardiographic (ECG) signals is a challengin...
research
09/09/2021

Cartography Active Learning

We propose Cartography Active Learning (CAL), a novel Active Learning (A...
research
04/12/2023

Does Informativeness Matter? Active Learning for Educational Dialogue Act Classification

Dialogue Acts (DAs) can be used to explain what expert tutors do and wha...
research
04/26/2017

On Using Active Learning and Self-Training when Mining Performance Discussions on Stack Overflow

Abundant data is the key to successful machine learning. However, superv...

Please sign up or login with your details

Forgot password? Click here to reset