A Unified View of Label Shift Estimation

03/17/2020
by   Saurabh Garg, et al.
3

Label shift describes the setting where although the label distribution might change between the source and target domains, the class-conditional probabilities (of data given a label) do not. There are two dominant approaches for estimating the label marginal. BBSE, a moment-matching approach based on confusion matrices, is provably consistent and provides interpretable error bounds. However, a maximum likelihood estimation approach, which we call MLLS, dominates empirically. In this paper, we present a unified view of the two methods and the first theoretical characterization of the likelihood-based estimator. Our contributions include (i) conditions for consistency of MLLS, which include calibration of the classifier and a confusion matrix invertibility condition that BBSE also requires; (ii) a unified view of the methods, casting the confusion matrix as roughly equivalent to MLLS for a particular choice of calibration method; and (iii) a decomposition of MLLS's finite-sample error into terms reflecting the impacts of miscalibration and estimation error. Our analysis attributes BBSE's statistical inefficiency to a loss of information due to coarse calibration. We support our findings with experiments on both synthetic data and the MNIST and CIFAR10 image recognition datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2023

ELSA: Efficient Label Shift Adaptation through the Lens of Semiparametric Models

We study the domain adaptation problem with label shift in this work. Un...
research
12/29/2017

Finite-sample risk bounds for maximum likelihood estimation with arbitrary penalties

The MDL two-part coding index of resolvability provides a finite-sampl...
research
10/13/2022

A Consistent and Differentiable Lp Canonical Calibration Error Estimator

Calibrated probabilistic classifiers are models whose predicted probabil...
research
06/07/2023

Label Shift Quantification with Robustness Guarantees via Distribution Feature Matching

Quantification learning deals with the task of estimating the target lab...
research
06/25/2023

TCE: A Test-Based Approach to Measuring Calibration Error

This paper proposes a new metric to measure the calibration error of pro...
research
05/18/2023

Minimum-Risk Recalibration of Classifiers

Recalibrating probabilistic classifiers is vital for enhancing the relia...
research
07/09/2023

Doubly Flexible Estimation under Label Shift

In studies ranging from clinical medicine to policy research, complete d...

Please sign up or login with your details

Forgot password? Click here to reset