D-Score: A White-Box Diagnosis Score for CNNs Based on Mutation Operators

04/03/2023
by   Xin Zhang, et al.
0

Convolutional neural networks (CNNs) have been widely applied in many safety-critical domains, such as autonomous driving and medical diagnosis. However, concerns have been raised with respect to the trustworthiness of these models: The standard testing method evaluates the performance of a model on a test set, while low-quality and insufficient test sets can lead to unreliable evaluation results, which can have unforeseeable consequences. Therefore, how to comprehensively evaluate CNNs and, based on the evaluation results, how to enhance their trustworthiness are the key problems to be urgently addressed. Prior work has used mutation tests to evaluate the test sets of CNNs. However, the evaluation scores are black boxes and not explicit enough for what is being tested. In this paper, we propose a white-box diagnostic approach that uses mutation operators and image transformation to calculate the feature and attention distribution of the model and further present a diagnosis score, namely D-Score, to reflect the model's robustness and fitness to a dataset. We also propose a D-Score based data augmentation method to enhance the CNN's performance to translations and rescalings. Comprehensive experiments on two widely used datasets and three commonly adopted CNNs demonstrate the effectiveness of our approach.

READ FULL TEXT

page 1

page 4

research
02/28/2020

Design of Mutation Operators for Testing Geographic Information Systems

In this article, we propose the definition of specific mutation operator...
research
11/11/2020

Dealing with Robustness of Convolutional Neural Networks for Image Classification

SW-based systems depend more and more on AI also for critical tasks. For...
research
08/02/2023

TEASMA: A Practical Approach for the Test Assessment of Deep Neural Networks using Mutation Analysis

Successful deployment of Deep Neural Networks (DNNs), particularly in sa...
research
01/21/2023

Towards a Measure of Trustworthiness to Evaluate CNNs During Operation

Due to black box nature of Convolutional neural networks (CNNs), the con...
research
08/11/2022

A Probabilistic Framework for Mutation Testing in Deep Neural Networks

Context: Mutation Testing (MT) is an important tool in traditional Softw...
research
06/08/2021

White Paper Assistance: A Step Forward Beyond the Shortcut Learning

The promising performances of CNNs often overshadow the need to examine ...

Please sign up or login with your details

Forgot password? Click here to reset