Global explanations for discovering bias in data

05/05/2020
by   Agnieszka Mikołajczyk, et al.
12

In the paper, we propose attention-based summarized post-hoc explanations for detection and identification of bias in data. We propose a global explanation and introduce a step-by-step framework on how to detect and test bias. Then, the bias is evaluated with a proposed counterfactual approach to bias insertion. Because removing the unwanted bias is often a complicated and tremendous task, we automatically insert it, instead. We validate our results on the example of the skin lesion dataset. Using the method, we successfully identified and confirmed part of the possible bias-causing artifacts in dermoscopy images. We confirmed that the commonplace black frames in the training dataset images have a strong influence on the Convolutional Neural Network's prediction. After artificially adding a black frame to all images, around 22 shown that bias detection is an important step of making more robust models, and we discuss how to improve them

READ FULL TEXT

page 4

page 5

page 6

research
12/10/2020

Debiased-CAM for bias-agnostic faithful visual explanations of deep convolutional networks

Class activation maps (CAMs) explain convolutional neural network predic...
research
12/10/2020

Investigating Bias in Image Classification using Model Explanations

We evaluated whether model explanations could efficiently detect bias in...
research
07/22/2019

The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Post-hoc interpretability approaches have been proven to be powerful too...
research
08/10/2023

Test-Time Selection for Robust Skin Lesion Analysis

Skin lesion analysis models are biased by artifacts placed during image ...
research
09/09/2021

IFBiD: Inference-Free Bias Detection

This paper is the first to explore an automatic way to detect bias in de...
research
08/18/2023

Data augmentation and explainability for bias discovery and mitigation in deep learning

This dissertation explores the impact of bias in deep neural networks an...
research
05/05/2020

Contextualizing Hate Speech Classifiers with Post-hoc Explanation

Hate speech classifiers trained on imbalanced datasets struggle to deter...

Please sign up or login with your details

Forgot password? Click here to reset