xGEMs: Generating Examplars to Explain Black-Box Models

06/22/2018
by   Shalmali Joshi, et al.
0

This work proposes xGEMs or manifold guided exemplars, a framework to understand black-box classifier behavior by exploring the landscape of the underlying data manifold as data points cross decision boundaries. To do so, we train an unsupervised implicit generative model -- treated as a proxy to the data manifold. We summarize black-box model behavior quantitatively by perturbing data samples along the manifold. We demonstrate xGEMs' ability to detect and quantify bias in model learning and also for understanding the changes in model behavior as training progresses.

READ FULL TEXT

page 7

page 8

page 11

page 12

research
09/22/2008

Modeling and Control with Local Linearizing Nadaraya Watson Regression

Black box models of technical systems are purely descriptive. They do no...
research
10/21/2020

Black-Box Ripper: Copying black-box models using generative evolutionary algorithms

We study the task of replicating the functionality of black-box neural m...
research
02/08/2023

Adversarial Prompting for Black Box Foundation Models

Prompting interfaces allow users to quickly adjust the output of generat...
research
11/30/2021

Black box tests for algorithmic stability

Algorithmic stability is a concept from learning theory that expresses t...
research
11/07/2022

Proper losses for discrete generative models

We initiate the study of proper losses for evaluating generative models ...
research
06/16/2020

Sample-Efficient Optimization in the Latent Space of Deep Generative Models via Weighted Retraining

Many important problems in science and engineering, such as drug design,...
research
04/22/2021

Patch Shortcuts: Interpretable Proxy Models Efficiently Find Black-Box Vulnerabilities

An important pillar for safe machine learning (ML) is the systematic mit...

Please sign up or login with your details

Forgot password? Click here to reset