FaceGuard: A Self-Supervised Defense Against Adversarial Face Images

11/28/2020
by   Debayan Deb, et al.
41

Prevailing defense mechanisms against adversarial face images tend to overfit to the adversarial perturbations in the training set and fail to generalize to unseen adversarial attacks. We propose a new self-supervised adversarial defense framework, namely FaceGuard, that can automatically detect, localize, and purify a wide variety of adversarial faces without utilizing pre-computed adversarial training samples. During training, FaceGuard automatically synthesizes challenging and diverse adversarial attacks, enabling a classifier to learn to distinguish them from real faces and a purifier attempts to remove the adversarial perturbations in the image space. Experimental results on LFW dataset show that FaceGuard can achieve 99.81 adversarial attack types. In addition, the proposed method can enhance the face recognition performance of ArcFace from 34.27 to 77.46

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset