Shift Happens: Adjusting Classifiers

Minimizing expected loss measured by a proper scoring rule, such as Brier score or log-loss (cross-entropy), is a common objective while training a probabilistic classifier. If the data have experienced dataset shift where the class distributions change post-training, then often the model's performance will decrease, over-estimating the probabilities of some classes while under-estimating the others on average. We propose unbounded and bounded general adjustment (UGA and BGA) methods that transform all predictions to (re-)equalize the average prediction and the class distribution. These methods act differently depending on which proper scoring rule is to be minimized, and we have a theoretical guarantee of reducing loss on test data, if the exact class distribution is known. We also demonstrate experimentally that, when in practice the class distribution is known only approximately, there is often still a reduction in loss depending on the amount of shift and the precision to which the class distribution is known.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/16/2019

A new example for a proper scoring rule

We give a new example for a proper scoring rule motivated by the form of...
research
04/27/2019

Analysis of Confident-Classifiers for Out-of-distribution Detection

Discriminatively trained neural classifiers can be trusted, only when th...
research
07/15/2021

Optimal Scoring Rule Design

This paper introduces an optimization problem for proper scoring rule de...
research
07/29/2022

Factorizable Joint Shift in Multinomial Classification

Factorizable joint shift (FJS) was recently proposed as a type of datase...
research
12/19/2021

Managing dataset shift by adversarial validation for credit scoring

Dataset shift is common in credit scoring scenarios, and the inconsisten...
research
05/18/2021

Label Inference Attacks from Log-loss Scores

Log-loss (also known as cross-entropy loss) metric is ubiquitously used ...
research
01/15/2020

Generalized Bayesian Quantification Learning

Quantification Learning is the task of prevalence estimation for a test ...

Please sign up or login with your details

Forgot password? Click here to reset