Towards Trustworthy Dataset Distillation

07/18/2023
by   Shijie Ma, et al.
0

Efficiency and trustworthiness are two eternal pursuits when applying deep learning in real-world applications. With regard to efficiency, dataset distillation (DD) endeavors to reduce training costs by distilling the large dataset into a tiny synthetic dataset. However, existing methods merely concentrate on in-distribution (InD) classification in a closed-world setting, disregarding out-of-distribution (OOD) samples. On the other hand, OOD detection aims to enhance models' trustworthiness, which is always inefficiently achieved in full-data settings. For the first time, we simultaneously consider both issues and propose a novel paradigm called Trustworthy Dataset Distillation (TrustDD). By distilling both InD samples and outliers, the condensed datasets are capable to train models competent in both InD classification and OOD detection. To alleviate the requirement of real outlier data and make OOD detection more practical, we further propose to corrupt InD samples to generate pseudo-outliers and introduce Pseudo-Outlier Exposure (POE). Comprehensive experiments on various settings demonstrate the effectiveness of TrustDD, and the proposed POE surpasses state-of-the-art method Outlier Exposure (OE). Compared with the preceding DD, TrustDD is more trustworthy and applicable to real open-world scenarios. Our code will be publicly available.

READ FULL TEXT

page 8

page 12

page 13

page 16

page 17

page 18

page 19

page 20

research
06/28/2022

POEM: Out-of-Distribution Detection with Posterior Sampling

Out-of-distribution (OOD) detection is indispensable for machine learnin...
research
03/30/2023

OpenMix: Exploring Outlier Samples for Misclassification Detection

Reliable confidence estimation for deep neural classifiers is a challeng...
research
03/22/2023

AUTO: Adaptive Outlier Optimization for Online Test-Time OOD Detection

Out-of-distribution (OOD) detection is a crucial aspect of deploying mac...
research
07/18/2023

Pseudo Outlier Exposure for Out-of-Distribution Detection using Pretrained Transformers

For real-world language applications, detecting an out-of-distribution (...
research
09/13/2018

Does Your Model Know the Digit 6 Is Not a Cat? A Less Biased Evaluation of "Outlier" Detectors

In the real world, a learning system could receive an input that looks n...
research
06/07/2023

Learning with Noisy Labels by Adaptive Gradient-Based Outlier Removal

An accurate and substantial dataset is necessary to train a reliable and...
research
05/27/2020

An Entropy Based Outlier Score and its Application to Novelty Detection for Road Infrastructure Images

A novel unsupervised outlier score, which can be embedded into graph bas...

Please sign up or login with your details

Forgot password? Click here to reset