Towards Adversarially Robust Deepfake Detection: An Ensemble Approach

02/11/2022
by   Ashish Hooda, et al.
1

Detecting deepfakes is an important problem, but recent work has shown that DNN-based deepfake detectors are brittle against adversarial deepfakes, in which an adversary adds imperceptible perturbations to a deepfake to evade detection. In this work, we show that a modification to the detection strategy in which we replace a single classifier with a carefully chosen ensemble, in which input transformations for each model in the ensemble induces pairwise orthogonal gradients, can significantly improve robustness beyond the de facto solution of adversarial training. We present theoretical results to show that such orthogonal gradients can help thwart a first-order adversary by reducing the dimensionality of the input subspace in which adversarial deepfakes lie. We validate the results empirically by instantiating and evaluating a randomized version of such "orthogonal" ensembles for adversarial deepfake detection and find that these randomized ensembles exhibit significantly higher robustness as deepfake detectors compared to state-of-the-art deepfake detectors against adversarial deepfakes, even those created using strong PGD-500 attacks.

READ FULL TEXT
research
03/04/2023

Improved Robustness Against Adaptive Attacks With Ensembles and Error-Correcting Output Codes

Neural network ensembles have been studied extensively in the context of...
research
04/21/2020

Certifying Joint Adversarial Robustness for Model Ensembles

Deep Neural Networks (DNNs) are often vulnerable to adversarial examples...
research
11/22/2018

Strength in Numbers: Trading-off Robustness and Computation via Adversarially-Trained Ensembles

While deep learning has led to remarkable results on a number of challen...
research
06/12/2019

A Stratified Approach to Robustness for Randomly Smoothed Classifiers

Strong theoretical guarantees of robustness can be given for ensembles o...
research
06/05/2023

Input gradient diversity for neural network ensembles

Deep Ensembles (DEs) demonstrate improved accuracy, calibration and robu...
research
06/15/2022

Evaluating object detector ensembles for improving the robustness of artifact detection in endoscopic video streams

In this contribution we use an ensemble deep-learning method for combini...
research
02/02/2023

On the Robustness of Randomized Ensembles to Adversarial Perturbations

Randomized ensemble classifiers (RECs), where one classifier is randomly...

Please sign up or login with your details

Forgot password? Click here to reset