Model Reduction of Shallow CNN Model for Reliable Deployment of Information Extraction from Medical Reports

07/31/2020
by   Abhishek K Dubey, et al.
0

Shallow Convolution Neural Network (CNN) is a time-tested tool for the information extraction from cancer pathology reports. Shallow CNN performs competitively on this task to other deep learning models including BERT, which holds the state-of-the-art for many NLP tasks. The main insight behind this eccentric phenomenon is that the information extraction from cancer pathology reports require only a small number of domain-specific text segments to perform the task, thus making the most of the texts and contexts excessive for the task. Shallow CNN model is well-suited to identify these key short text segments from the labeled training set; however, the identified text segments remain obscure to humans. In this study, we fill this gap by developing a model reduction tool to make a reliable connection between CNN filters and relevant text segments by discarding the spurious connections. We reduce the complexity of shallow CNN representation by approximating it with a linear transformation of n-gram presence representation with a non-negativity and sparsity prior on the transformation weights to obtain an interpretable model. Our approach bridge the gap between the conventionally perceived trade-off boundary between accuracy on the one side and explainability on the other by model reduction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/05/2021

Integration of Domain Knowledge using Medical Knowledge Graph Deep Learning for Cancer Phenotyping

A key component of deep learning (DL) for natural language processing (N...
research
12/06/2018

Pathology Extraction from Chest X-Ray Radiology Reports: A Performance Study

Extraction of relevant pathological terms from radiology reports is impo...
research
11/13/2018

Predicting Distresses using Deep Learning of Text Segments in Annual Reports

Corporate distress models typically only employ the numerical financial ...
research
09/25/2022

Application of Deep Learning in Generating Structured Radiology Reports: A Transformer-Based Technique

Since radiology reports needed for clinical practice and research are wr...
research
03/02/2023

Learning From Yourself: A Self-Distillation Method for Fake Speech Detection

In this paper, we propose a novel self-distillation method for fake spee...
research
09/21/2023

Precision in Building Extraction: Comparing Shallow and Deep Models using LiDAR Data

Building segmentation is essential in infrastructure development, popula...
research
09/10/2020

Why I'm not Answering

Safe deployment of deep learning systems in critical real world applicat...

Please sign up or login with your details

Forgot password? Click here to reset