Exemplar Auditing for Multi-Label Biomedical Text Classification

04/07/2020
by   Allen Schmaltz, et al.
0

Many practical applications of AI in medicine consist of semi-supervised discovery: The investigator aims to identify features of interest at a resolution more fine-grained than that of the available human labels. This is often the scenario faced in healthcare applications as coarse, high-level labels (e.g., billing codes) are often the only sources that are readily available. These challenges are compounded for modalities such as text, where the feature space is very high-dimensional, and often contains considerable amounts of noise. In this work, we generalize a recently proposed zero-shot sequence labeling method, "binary labeling via a convolutional decomposition", to the case where the available document-level human labels are themselves relatively high-dimensional. The approach yields classification with "introspection", relating the fine-grained features of an inference-time prediction to their nearest neighbors from the training set, under the model. The approach is effective, yet parsimonious, as demonstrated on a well-studied MIMIC-III multi-label classification task of electronic health record data, and is useful as a tool for organizing the analysis of neural model predictions and high-dimensional datasets. Our proposed approach yields both a competitively effective classification model and an interrogation mechanism to aid healthcare workers in understanding the salient features that drive the model's predictions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2020

Seeing The Whole Patient: Using Multi-Label Medical Text Classification Techniques to Enhance Predictions of Medical Codes

Machine learning-based multi-label medical text classifications can be u...
research
09/28/2019

Generalized Zero-shot ICD Coding

The International Classification of Diseases (ICD) is a list of classifi...
research
12/19/2020

Towards Coarse and Fine-grained Multi-Graph Multi-Label Learning

Multi-graph multi-label learning (Mgml) is a supervised learning framewo...
research
04/27/2020

GraftNet: An Engineering Implementation of CNN for Fine-grained Multi-label Task

Multi-label networks with branches are proved to perform well in both ac...
research
06/10/2019

Label-Agnostic Sequence Labeling by Copying Nearest Neighbors

Retrieve-and-edit based approaches to structured prediction, where struc...
research
05/01/2017

Regularizing Model Complexity and Label Structure for Multi-Label Text Classification

Multi-label text classification is a popular machine learning task where...

Please sign up or login with your details

Forgot password? Click here to reset