Explaining Model Confidence Using Counterfactuals

03/10/2023
by   Thao Le, et al.
0

Displaying confidence scores in human-AI interaction has been shown to help build trust between humans and AI systems. However, most existing research uses only the confidence score as a form of communication. As confidence scores are just another model output, users may want to understand why the algorithm is confident to determine whether to accept the confidence score. In this paper, we show that counterfactual explanations of confidence scores help study participants to better understand and better trust a machine learning model's prediction. We present two methods for understanding model confidence using counterfactual explanation: (1) based on counterfactual examples; and (2) based on visualisation of the counterfactual space. Both increase understanding and trust for study participants over a baseline of no explanation, but qualitative results show that they are used quite differently, leading to recommendations of when to use each one and directions of designing better explanations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2022

Improving Model Understanding and Trust with Counterfactual Explanations of Model Confidence

In this paper, we show that counterfactual explanations of confidence sc...
research
04/14/2021

To Trust or Not to Trust a Regressor: Estimating and Explaining Trustworthiness of Regression Predictions

In hybrid human-AI systems, users need to decide whether or not to trust...
research
06/15/2020

Explaining reputation assessments

Reputation is crucial to enabling human or software agents to select amo...
research
03/16/2023

Explaining Groups of Instances Counterfactually for XAI: A Use Case, Algorithm and User Study for Group-Counterfactuals

Counterfactual explanations are an increasingly popular form of post hoc...
research
09/15/2021

Voter Perceptions of Trust in Risk-Limiting Audits

Risk-limiting audits (RLAs) are expected to strengthen the public confid...
research
10/30/2021

On Quantitative Evaluations of Counterfactuals

As counterfactual examples become increasingly popular for explaining de...
research
04/02/2023

The Effect of Counterfactuals on Reading Chest X-rays

This study evaluates the effect of counterfactual explanations on the in...

Please sign up or login with your details

Forgot password? Click here to reset