Reject Illegal Inputs with Generative Classifier Derived from Any Discriminative Classifier

01/02/2020
by   Xin Wang, et al.
0

Generative classifiers have been shown promising to detect illegal inputs including adversarial examples and out-of-distribution samples. Supervised Deep Infomax (SDIM) is a scalable end-to-end framework to learn generative classifiers. In this paper, we propose a modification of SDIM termed SDIM-logit. Instead of training generative classifier from scratch, SDIM-logit first takes as input the logits produced any given discriminative classifier, and generate logit representations; then a generative classifier is derived by imposing statistical constraints on logit representations. SDIM-logit could inherit the performance of the discriminative classifier without loss. SDIM-logit incurs a negligible number of additional parameters, and can be efficiently trained with base classifiers fixed. We perform classification with rejection, where test samples whose class conditionals are smaller than pre-chosen thresholds will be rejected without predictions. Experiments on illegal inputs, including adversarial examples, samples with common corruptions, and out-of-distribution (OOD) samples show that allowed to reject a portion of test samples, SDIM-logit significantly improves the performance on the left test sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2019

Are Odds Really Odd? Bypassing Statistical Detection of Adversarial Examples

Deep learning classifiers are known to be vulnerable to adversarial exam...
research
05/21/2018

Generative Adversarial Examples

Adversarial examples are typically constructed by perturbing an existing...
research
04/25/2017

Introspective Classifier Learning: Empower Generatively

In this paper we propose introspective classifier learning (ICL) that em...
research
02/06/2020

Neural Network Representation Control: Gaussian Isolation Machines and CVC Regularization

In many cases, neural network classifiers are likely to be exposed to in...
research
10/04/2022

Distance Based Image Classification: A solution to generative classification's conundrum?

Most classifiers rely on discriminative boundaries that separate instanc...
research
10/07/2013

Discriminative Features via Generalized Eigenvectors

Representing examples in a way that is compatible with the underlying cl...
research
06/15/2023

Training Diffusion Classifiers with Denoising Assistance

Score-matching and diffusion models have emerged as state-of-the-art gen...

Please sign up or login with your details

Forgot password? Click here to reset