Radioactive data: tracing through training

02/03/2020
by   Alexandre Sablayrolles, et al.
2

We want to detect whether a particular image dataset has been used to train a model. We propose a new technique, radioactive data, that makes imperceptible changes to this dataset such that any model trained on it will bear an identifiable mark. The mark is robust to strong variations such as different architectures or optimization methods. Given a trained model, our technique detects the use of radioactive data and provides a level of confidence (p-value). Our experiments on large-scale benchmarks (Imagenet), using standard architectures (Resnet-18, VGG-16, Densenet-121) and training procedures, show that we can detect usage of radioactive data with high confidence (p<10^-4) even when only 1 radioactive. Our method is robust to data augmentation and the stochasticity of deep network optimization. As a result, it offers a much higher signal-to-noise ratio than data poisoning and backdoor methods.

READ FULL TEXT
research
09/17/2018

Déjà Vu: an empirical evaluation of the memorization properties of ConvNets

Convolutional neural networks memorize part of their training data, whic...
research
11/01/2018

The Natural Auditor: How To Tell If Someone Used Your Words To Train Their Model

To help enforce data-protection regulations such as GDPR and detect unau...
research
01/21/2021

DataLoc+: A Data Augmentation Technique for Machine Learning in Room-Level Indoor Localization

Indoor localization has been a hot area of research over the past two de...
research
04/11/2018

KS(conf ): A Light-Weight Test if a ConvNet Operates Outside of Its Specifications

Computer vision systems for automatic image categorization have become a...
research
05/05/2021

Rethinking Ultrasound Augmentation: A Physics-Inspired Approach

Medical Ultrasound (US), despite its wide use, is characterized by artif...
research
07/04/2022

A Robust Ensemble Model for Patasitic Egg Detection and Classification

Intestinal parasitic infections, as a leading causes of morbidity worldw...

Please sign up or login with your details

Forgot password? Click here to reset