Uncertainty-Aware Attention for Reliable Interpretation and Prediction

05/24/2018
by   Jay Heo, et al.
0

Attention mechanism is effective in both focusing the deep learning models on relevant features and interpreting them. However, attentions may be unreliable since the networks that generate them are often trained in a weakly-supervised manner. To overcome this limitation, we introduce the notion of input-dependent uncertainty to the attention mechanism, such that it generates attention for each feature with varying degrees of noise based on the given input, to learn larger variance on instances it is uncertain about. We learn this Uncertainty-aware Attention (UA) mechanism using variational inference, and validate it on various risk prediction tasks from electronic health records on which our model significantly outperforms existing attention models. The analysis of the learned attentions shows that our model generates attentions that comply with clinicians' interpretation, and provide richer interpretation via learned variance. Further evaluation of both the accuracy of the uncertainty calibration and the prediction performance with "I don't know" decision show that UA yields networks with high reliability as well.

READ FULL TEXT
research
01/31/2022

Interpretable and Generalizable Graph Learning via Stochastic Attention Mechanism

Interpretable graph learning is in need as many scientific applications ...
research
07/14/2019

Modeling the Uncertainty in Electronic Health Records: a Bayesian Deep Learning Approach

Deep learning models have exhibited superior performance in predictive t...
research
10/22/2020

UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced Data

Successful health risk prediction demands accuracy and reliability of th...
research
08/22/2023

Uncertainty Estimation of Transformers' Predictions via Topological Analysis of the Attention Matrices

Determining the degree of confidence of deep learning model in its predi...
research
12/22/2019

Hierarchical Target-Attentive Diagnosis Prediction in Heterogeneous Information Networks

We introduce HTAD, a novel model for diagnosis prediction using Electron...
research
11/26/2021

TDAN: Top-Down Attention Networks for Enhanced Feature Selectivity in CNNs

Attention modules for Convolutional Neural Networks (CNNs) are an effect...
research
08/23/2019

A comparative study for interpreting deep learning prediction of the Parkinson's disease diagnosis from SPECT imaging

The application of deep learning to single-photon emission computed tomo...

Please sign up or login with your details

Forgot password? Click here to reset