Detecting Adversarial Samples Using Density Ratio Estimates

05/05/2017
by   Lovedeep Gondara, et al.
0

Machine learning models, especially based on deep architectures are used in everyday applications ranging from self driving cars to medical diagnostics. It has been shown that such models are dangerously susceptible to adversarial samples, indistinguishable from real samples to human eye, adversarial samples lead to incorrect classifications with high confidence. Impact of adversarial samples is far-reaching and their efficient detection remains an open problem. We propose to use direct density ratio estimation as an efficient model agnostic measure to detect adversarial samples. Our proposed method works equally well with single and multi-channel samples, and with different adversarial sample generation methods. We also propose a method to use density ratio estimates for generating adversarial samples with an added constraint of preserving density ratio.

READ FULL TEXT
research
05/05/2021

Attack-agnostic Adversarial Detection on Medical Data Using Explainable Machine Learning

Explainable machine learning has become increasingly prevalent, especial...
research
07/09/2021

GGT: Graph-Guided Testing for Adversarial Sample Detection of Deep Neural Network

Deep Neural Networks (DNN) are known to be vulnerable to adversarial sam...
research
11/18/2019

Optimal Single-Choice Prophet Inequalities from Samples

We study the single-choice Prophet Inequality problem when the gambler i...
research
10/23/2022

Falsehoods that ML researchers believe about OOD detection

An intuitive way to detect out-of-distribution (OOD) data is via the den...
research
03/01/2017

Detecting Adversarial Samples from Artifacts

Deep neural networks (DNNs) are powerful nonlinear architectures that ar...
research
07/10/2017

Towards Crafting Text Adversarial Samples

Adversarial samples are strategically modified samples, which are crafte...
research
04/23/2021

Lightweight Detection of Out-of-Distribution and Adversarial Samples via Channel Mean Discrepancy

Detecting out-of-distribution (OOD) and adversarial samples is essential...

Please sign up or login with your details

Forgot password? Click here to reset