Evaluating Out-of-Distribution Detectors Through Adversarial Generation of Outliers

08/20/2022
by   Sangwoong Yoon, et al.
0

A reliable evaluation method is essential for building a robust out-of-distribution (OOD) detector. Current robustness evaluation protocols for OOD detectors rely on injecting perturbations to outlier data. However, the perturbations are unlikely to occur naturally or not relevant to the content of data, providing a limited assessment of robustness. In this paper, we propose Evaluation-via-Generation for OOD detectors (EvG), a new protocol for investigating the robustness of OOD detectors under more realistic modes of variation in outliers. EvG utilizes a generative model to synthesize plausible outliers, and employs MCMC sampling to find outliers misclassified as in-distribution with the highest confidence by a detector. We perform a comprehensive benchmark comparison of the performance of state-of-the-art OOD detectors using EvG, uncovering previously overlooked weaknesses.

READ FULL TEXT

page 1

page 5

page 7

page 8

page 9

page 13

research
10/13/2021

C-AllOut: Catching Calling Outliers by Type

Given an unlabeled dataset, wherein we have access only to pairwise simi...
research
03/24/2020

Adversarial Perturbations Fool Deepfake Detectors

This work uses adversarial perturbations to enhance deepfake images and ...
research
09/13/2018

Does Your Model Know the Digit 6 Is Not a Cat? A Less Biased Evaluation of "Outlier" Detectors

In the real world, a learning system could receive an input that looks n...
research
07/20/2018

Large scale evaluation of local image feature detectors on homography datasets

We present a large scale benchmark for the evaluation of local feature d...
research
11/14/2021

Impact of Benign Modifications on Discriminative Performance of Deepfake Detectors

Deepfakes are becoming increasingly popular in both good faith applicati...
research
07/08/2022

Outliers, Dynamics, and the Independence Postulate

We show that outliers occur almost surely in computable dynamics over in...
research
12/14/2019

Towards Robust Toxic Content Classification

Toxic content detection aims to identify content that can offend or harm...

Please sign up or login with your details

Forgot password? Click here to reset