A cryptographic approach to black box adversarial machine learning

06/07/2019
by   Kevin Shi, et al.
2

We propose an ensemble technique for converting any classifier into a computationally secure classifier. We define a simpler security problem for random binary classifiers and prove a reduction from this model to the security of the overall ensemble classifier. We provide experimental evidence of the security of our random binary classifiers, as well as empirical results of the adversarial accuracy of the overall ensemble to black-box attacks. Our construction crucially leverages hidden randomness in the multiclass-to-binary reduction.

READ FULL TEXT

page 6

page 7

research
10/13/2022

A Logic of "Black Box" Classifier Systems

Binary classifiers are traditionally studied by propositional logic (PL)...
research
10/16/2020

Embedding and Synthesis of Knowledge in Tree Ensemble Classifiers

This paper studies the embedding and synthesis of knowledge in tree ense...
research
08/23/2023

Ensembling Uncertainty Measures to Improve Safety of Black-Box Classifiers

Machine Learning (ML) algorithms that perform classification may predict...
research
04/14/2022

Planting Undetectable Backdoors in Machine Learning Models

Given the computational cost and technical expertise required to train m...
research
03/23/2017

Data Driven Exploratory Attacks on Black Box Classifiers in Adversarial Domains

While modern day web applications aim to create impact at the civilizati...
research
06/11/2018

Learning to Speed Up Structured Output Prediction

Predicting structured outputs can be computationally onerous due to the ...
research
06/01/2022

Discovering the Hidden Vocabulary of DALLE-2

We discover that DALLE-2 seems to have a hidden vocabulary that can be u...

Please sign up or login with your details

Forgot password? Click here to reset