Learning Sample Difficulty from Pre-trained Models for Reliable Prediction

04/20/2023
by   Peng Cui, et al.
0

Large-scale pre-trained models have achieved remarkable success in a variety of scenarios and applications, but how to leverage them to improve the prediction reliability of downstream models is undesirably under-explored. Moreover, modern neural networks have been found to be poorly calibrated and make overconfident predictions regardless of inherent sample difficulty and data uncertainty. To address this issue, we propose to utilize large-scale pre-trained models to guide downstream model training with sample difficulty-aware entropy regularization. Pre-trained models that have been exposed to large-scale datasets and do not overfit the downstream training classes enable us to measure each training sample difficulty via feature-space Gaussian modeling and relative Mahalanobis distance computation. Importantly, by adaptively penalizing overconfident prediction based on the sample's difficulty, we simultaneously improve accuracy and uncertainty calibration on various challenging benchmarks, consistently surpassing competitive baselines for reliable prediction.

READ FULL TEXT

page 1

page 12

page 13

page 14

page 15

research
03/09/2023

Rethinking Visual Prompt Learning as Masked Visual Token Modeling

Prompt learning has achieved great success in efficiently exploiting lar...
research
10/12/2022

Can Calibration Improve Sample Prioritization?

Calibration can reduce overconfident predictions of deep neural networks...
research
03/20/2023

Architecture, Dataset and Model-Scale Agnostic Data-free Meta-Learning

The goal of data-free meta-learning is to learn useful prior knowledge f...
research
10/24/2022

Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval

Current person image retrieval methods have achieved great improvements ...
research
08/21/2023

Towards Accelerated Model Training via Bayesian Data Selection

Mislabeled, duplicated, or biased data in real-world scenarios can lead ...
research
05/31/2023

Representation Reliability and Its Impact on Downstream Tasks

Self-supervised pre-trained models extract general-purpose representatio...
research
03/28/2023

Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery

Discovering novel concepts from unlabelled data and in a continuous mann...

Please sign up or login with your details

Forgot password? Click here to reset