Backdoor for Debias: Mitigating Model Bias with Backdoor Attack-based Artificial Bias

03/01/2023
by   Shangxi Wu, et al.
0

With the swift advancement of deep learning, state-of-the-art algorithms have been utilized in various social situations. Nonetheless, some algorithms have been discovered to exhibit biases and provide unequal results. The current debiasing methods face challenges such as poor utilization of data or intricate training requirements. In this work, we found that the backdoor attack can construct an artificial bias similar to the model bias derived in standard training. Considering the strong adjustability of backdoor triggers, we are motivated to mitigate the model bias by carefully designing reverse artificial bias created from backdoor attack. Based on this, we propose a backdoor debiasing framework based on knowledge distillation, which effectively reduces the model bias from original data and minimizes security risks from the backdoor attack. The proposed solution is validated on both image and structured datasets, showing promising results. This work advances the understanding of backdoor attacks and highlights its potential for beneficial applications. The code for the study can be found at <https://anonymous.4open.science/r/DwB-BC07/>.

READ FULL TEXT

page 1

page 5

page 9

research
11/23/2021

Algorithmic Fairness in Face Morphing Attack Detection

Face morphing attacks can compromise Face Recognition System (FRS) by ex...
research
06/15/2021

Simon Says: Evaluating and Mitigating Bias in Pruned Neural Networks with Knowledge Distillation

In recent years the ubiquitous deployment of AI has posed great concerns...
research
04/20/2022

Epistemic Uncertainty-Weighted Loss for Visual Bias Mitigation

Deep neural networks are highly susceptible to learning biases in visual...
research
04/03/2022

In Rain or Shine: Understanding and Overcoming Dataset Bias for Improving Robustness Against Weather Corruptions for Autonomous Vehicles

Several popular computer vision (CV) datasets, specifically employed for...
research
11/08/2021

Robust and Information-theoretically Safe Bias Classifier against Adversarial Attacks

In this paper, the bias classifier is introduced, that is, the bias part...
research
02/07/2023

Self-Sampling Training and Evaluation for the Accuracy-Bias Tradeoff in Recommendation

Research on debiased recommendation has shown promising results. However...
research
06/21/2019

Mitigating Bias in Algorithmic Employment Screening: Evaluating Claims and Practices

There has been rapidly growing interest in the use of algorithms for emp...

Please sign up or login with your details

Forgot password? Click here to reset