FaceGuard: A Self-Supervised Defense Against Adversarial Face Images

11/28/2020

∙

Prevailing defense mechanisms against adversarial face images tend to overfit to the adversarial perturbations in the training set and fail to generalize to unseen adversarial attacks. We propose a new self-supervised adversarial defense framework, namely FaceGuard, that can automatically detect, localize, and purify a wide variety of adversarial faces without utilizing pre-computed adversarial training samples. During training, FaceGuard automatically synthesizes challenging and diverse adversarial attacks, enabling a classifier to learn to distinguish them from real faces and a purifier attempts to remove the adversarial perturbations in the image space. Experimental results on LFW dataset show that FaceGuard can achieve 99.81 adversarial attack types. In addition, the proposed method can enhance the face recognition performance of ArcFace from 34.27 to 77.46

READ FULL TEXT

FaceGuard: A Self-Supervised Defense Against Adversarial Face Images

Sign in with Google

Consider DeepAI Pro