Data Shapley Value for Handling Noisy Labels: An application in Screening COVID-19 Pneumonia from Chest CT Scans

10/17/2021
by   Nastaran Enshaei, et al.
0

A long-standing challenge of deep learning models involves how to handle noisy labels, especially in applications where human lives are at stake. Adoption of the data Shapley Value (SV), a cooperative game theoretical approach, is an intelligent valuation solution to tackle the issue of noisy labels. Data SV can be used together with a learning model and an evaluation metric to validate each training point's contribution to the model's performance. The SV of a data point, however, is not unique and depends on the learning model, the evaluation metric, and other data points collaborating in the training game. However, effects of utilizing different evaluation metrics for computation of the SV, detecting the noisy labels, and measuring the data points' importance has not yet been thoroughly investigated. In this context, we performed a series of comparative analyses to assess SV's capabilities to detect noisy input labels when measured by different evaluation metrics. Our experiments on COVID-19-infected of CT images illustrate that although the data SV can effectively identify noisy labels, adoption of different evaluation metric can significantly influence its ability to identify noisy labels from different data classes. Specifically, we demonstrate that the SV greatly depends on the associated evaluation metric.

READ FULL TEXT
research
08/09/2022

Res-Dense Net for 3D Covid Chest CT-scan classification

One of the most contentious areas of research in Medical Image Preproces...
research
02/08/2018

A Semi-Supervised Two-Stage Approach to Learning from Noisy Labels

The recent success of deep neural networks is powered in part by large-s...
research
02/20/2023

Towards Unbounded Machine Unlearning

Deep machine unlearning is the problem of removing the influence of a co...
research
01/25/2022

Comparison of Evaluation Metrics for Landmark Detection in CMR Images

Cardiac Magnetic Resonance (CMR) images are widely used for cardiac diag...
research
03/30/2021

Noise-resistant Deep Metric Learning with Ranking-based Instance Selection

The existence of noisy labels in real-world data negatively impacts the ...
research
03/28/2021

Friends and Foes in Learning from Noisy Labels

Learning from examples with noisy labels has attracted increasing attent...
research
07/18/2018

Is it worth it? Budget-related evaluation metrics for model selection

Creating a linguistic resource is often done by using a machine learning...

Please sign up or login with your details

Forgot password? Click here to reset