Defense Against Adversarial Images using Web-Scale Nearest-Neighbor Search

03/05/2019
by   Abhimanyu Dubey, et al.
0

A plethora of recent work has shown that convolutional networks are not robust to adversarial images: images that are created by perturbing a sample from the data distribution as to maximize the loss on the perturbed example. In this work, we hypothesize that adversarial perturbations move the image away from the image manifold in the sense that there exists no physical process that could have produced the adversarial image. This hypothesis suggests that a successful defense mechanism against adversarial images should aim to project the images back onto the image manifold. We study such defense mechanisms, which approximate the projection onto the unknown image manifold by a nearest-neighbor search against a web-scale image database containing tens of billions of images. Empirical evaluations of this defense strategy on ImageNet suggest that it is very effective in attack settings in which the adversary does not have access to the image database. We also propose two novel attack methods to break nearest-neighbor defenses, and demonstrate conditions under which nearest-neighbor defense fails. We perform a series of ablation experiments, which suggest that there is a trade-off between robustness and accuracy in our defenses, that a large image database (with hundreds of millions of images) is crucial to get good performance, and that careful construction the image database is important to be robust against attacks tailored to circumvent our defenses.

READ FULL TEXT

page 2

page 3

research
03/20/2019

On the Robustness of Deep K-Nearest Neighbors

Despite a large amount of attention on adversarial examples, very few wo...
research
06/27/2021

ASK: Adversarial Soft k-Nearest Neighbor Attack and Defense

K-Nearest Neighbor (kNN)-based deep learning methods have been applied t...
research
12/18/2020

RAILS: A Robust Adversarial Immune-inspired Learning System

Adversarial attacks against deep neural networks are continuously evolvi...
research
12/11/2022

DISCO: Adversarial Defense with Local Implicit Functions

The problem of adversarial defenses for image classification, where the ...
research
10/31/2017

Countering Adversarial Images using Input Transformations

This paper investigates strategies that defend against adversarial-examp...
research
11/21/2017

Manifold Assumption and Defenses Against Adversarial Perturbations

In the adversarial perturbation problem of neural networks, an adversary...
research
04/22/2021

Robust Certification for Laplace Learning on Geometric Graphs

Graph Laplacian (GL)-based semi-supervised learning is one of the most u...

Please sign up or login with your details

Forgot password? Click here to reset