Black-Box Ripper: Copying black-box models using generative evolutionary algorithms

10/21/2020
by   Antonio Barbalau, et al.
0

We study the task of replicating the functionality of black-box neural models, for which we only know the output class probabilities provided for a set of input images. We assume back-propagation through the black-box model is not possible and its training images are not available, e.g. the model could be exposed only through an API. In this context, we present a teacher-student framework that can distill the black-box (teacher) model into a student model with minimal accuracy loss. To generate useful data samples for training the student, our framework (i) learns to generate images on a proxy data set (with images and classes different from those used to train the black-box) and (ii) applies an evolutionary strategy to make sure that each generated data sample exhibits a high response for a specific class when given as input to the black box. Our framework is compared with several baseline and state-of-the-art methods on three benchmark data sets. The empirical evidence indicates that our model is superior to the considered baselines. Although our method does not back-propagate through the black-box network, it generally surpasses state-of-the-art methods that regard the teacher as a glass-box model. Our code is available at: https://github.com/antoniobarbalau/black-box-ripper.

READ FULL TEXT
research
07/25/2022

Black-box Few-shot Knowledge Distillation

Knowledge distillation (KD) is an efficient approach to transfer the kno...
research
07/27/2022

Look Closer to Your Enemy: Learning to Attack via Teacher-student Mimicking

This paper aims to generate realistic attack samples of person re-identi...
research
06/01/2022

On the Perils of Cascading Robust Classifiers

Ensembling certifiably robust neural networks has been shown to be a pro...
research
04/24/2023

On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research

Perception of toxicity evolves over time and often differs between geogr...
research
06/06/2020

A Generic and Model-Agnostic Exemplar Synthetization Framework for Explainable AI

With the growing complexity of deep learning methods adopted in practica...
research
02/08/2023

Adversarial Prompting for Black Box Foundation Models

Prompting interfaces allow users to quickly adjust the output of generat...
research
06/22/2018

xGEMs: Generating Examplars to Explain Black-Box Models

This work proposes xGEMs or manifold guided exemplars, a framework to un...

Please sign up or login with your details

Forgot password? Click here to reset