Assessing Systematic Weaknesses of DNNs using Counterfactuals

08/03/2023
by   Sujan Sai Gannamaneni, et al.
0

With the advancement of DNNs into safety-critical applications, testing approaches for such models have gained more attention. A current direction is the search for and identification of systematic weaknesses that put safety assumptions based on average performance values at risk. Such weaknesses can take on the form of (semantically coherent) subsets or areas in the input space where a DNN performs systematically worse than its expected average. However, it is non-trivial to attribute the reason for such observed low performances to the specific semantic features that describe the subset. For instance, inhomogeneities within the data w.r.t. other (non-considered) attributes might distort results. However, taking into account all (available) attributes and their interaction is often computationally highly expensive. Inspired by counterfactual explanations, we propose an effective and computationally cheap algorithm to validate the semantic attribution of existing subsets, i.e., to check whether the identified attribute is likely to have caused the degraded performance. We demonstrate this approach on an example from the autonomous driving domain using highly annotated simulated data, where we show for a semantic segmentation model that (i) performance differences among the different pedestrian assets exist, but (ii) only in some cases is the asset type itself the reason for this reduction in the performance.

READ FULL TEXT
research
01/03/2023

Benchmarking the Robustness of LiDAR Semantic Segmentation Models

When using LiDAR semantic segmentation models for safety-critical applic...
research
08/01/2022

RankAxis: Towards a Systematic Combination of Projection and Ranking in Multi-Attribute Data Exploration

Projection and ranking are frequently used analysis techniques in multi-...
research
03/04/2019

Towards Structured Evaluation of Deep Neural Network Supervisors

Deep Neural Networks (DNN) have improved the quality of several non-safe...
research
11/11/2022

A Benchmark for Out of Distribution Detection in Point Cloud 3D Semantic Segmentation

Safety-critical applications like autonomous driving use Deep Neural Net...
research
04/12/2021

Improving Online Performance Prediction for Semantic Segmentation

In this work we address the task of observing the performance of a seman...
research
05/10/2022

A Safety Assurable Human-Inspired Perception Architecture

Although artificial intelligence-based perception (AIP) using deep neura...

Please sign up or login with your details

Forgot password? Click here to reset