Rethinking Backdoor Data Poisoning Attacks in the Context of Semi-Supervised Learning

12/05/2022
by   Marissa Connor, et al.
0

Semi-supervised learning methods can train high-accuracy machine learning models with a fraction of the labeled training samples required for traditional supervised learning. Such methods do not typically involve close review of the unlabeled training samples, making them tempting targets for data poisoning attacks. In this paper we investigate the vulnerabilities of semi-supervised learning methods to backdoor data poisoning attacks on the unlabeled samples. We show that simple poisoning attacks that influence the distribution of the poisoned samples' predicted labels are highly effective - achieving an average attack success rate as high as 96.9 framework targeting semi-supervised learning methods to better understand and exploit their limitations and to motivate future defense strategies.

READ FULL TEXT

page 8

page 10

page 14

page 16

research
05/04/2021

Poisoning the Unlabeled Dataset of Semi-Supervised Learning

Semi-supervised machine learning models learn from a (small) set of labe...
research
06/30/2019

On the Sample Complexity of HGR Maximal Correlation Functions

The Hirschfeld-Gebelein-Rényi (HGR) maximal correlation and the correspo...
research
06/10/2020

A Probabilistic Framework for Discriminative and Neuro-Symbolic Semi-Supervised Learning

In semi-supervised learning (SSL), a rule to predict labels y for data x...
research
09/01/2022

Attack Tactic Identification by Transfer Learning of Language Model

Cybersecurity has become a primary global concern with the rapid increas...
research
08/28/2020

Semi-supervised Learning with the EM Algorithm: A Comparative Study between Unstructured and Structured Prediction

Semi-supervised learning aims to learn prediction models from both label...
research
03/13/2020

Minor Constraint Disturbances for Deep Semi-supervised Learning

In high-dimensional data space, semi-supervised feature learning based o...
research
12/14/2018

Semi-Supervised Monaural Singing Voice Separation With a Masking Network Trained on Synthetic Mixtures

We study the problem of semi-supervised singing voice separation, in whi...

Please sign up or login with your details

Forgot password? Click here to reset