Learned Feature Attribution Priors

12/20/2019
by   Ethan Weinberger, et al.
2

Deep learning models have achieved breakthrough successes in domains where data is plentiful. However, such models are prone to overfitting when trained on high-dimensional, low sample size datasets. Furthermore, the black-box nature of such models has limited their application in domains where model trust is critical. As a result, deep learning has struggled to make inroads in domains such as precision medicine, where small sample sizes are the norm and model trust is paramount. Oftentimes, even in low data settings we have some set of prior information on each input feature to our prediction task, which may be related to that feature's relevance to the prediction problem. In this work we propose the learned attribution prior framework to take advantage of such information and alleviate the issues mentioned previously. For a given prediction task, our framework jointly learns a relationship between prior information about a feature and that feature's importance to the task, while also biasing the prediction model to focus on the features with high predicted importance. We find that training models using our framework improves model accuracy in low-data settings. Furthermore, we find that the resulting learned meta-feature to feature relationships open up new avenues for model interpretation.

READ FULL TEXT

page 5

page 6

research
06/25/2019

Learning Explainable Models Using Attribution Priors

Two important topics in deep learning both involve incorporating humans ...
research
07/05/2023

Harmonizing Feature Attributions Across Deep Learning Architectures: Enhancing Interpretability and Consistency

Ensuring the trustworthiness and interpretability of machine learning mo...
research
11/03/2022

Analysis of a Deep Learning Model for 12-Lead ECG Classification Reveals Learned Features Similar to Diagnostic Criteria

Despite their remarkable performance, deep neural networks remain unadop...
research
06/08/2023

Sound Explanation for Trustworthy Machine Learning

We take a formal approach to the explainability problem of machine learn...
research
04/01/2021

Explaining COVID-19 and Thoracic Pathology Model Predictions by Identifying Informative Input Features

Neural networks have demonstrated remarkable performance in classificati...
research
06/19/2019

Incorporating Priors with Feature Attribution on Text Classification

Feature attribution methods, proposed recently, help users interpret the...
research
03/07/2017

Regularising Non-linear Models Using Feature Side-information

Very often features come with their own vectorial descriptions which pro...

Please sign up or login with your details

Forgot password? Click here to reset