Label Shift Quantification with Robustness Guarantees via Distribution Feature Matching

06/07/2023
by   Bastien Dussap, et al.
0

Quantification learning deals with the task of estimating the target label distribution under label shift. In this paper, we first present a unifying framework, distribution feature matching (DFM), that recovers as particular instances various estimators introduced in previous literature. We derive a general performance bound for DFM procedures, improving in several key aspects upon previous bounds derived in particular cases. We then extend this analysis to study robustness of DFM procedures in the misspecified setting under departure from the exact label shift hypothesis, in particular in the case of contamination of the target by an unknown distribution. These theoretical findings are confirmed by a detailed numerical study on simulated and real-world datasets. We also introduce an efficient, scalable and robust version of kernel-based DFM using the Random Fourier Feature principle.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/09/2021

Online Adaptation to Label Distribution Shift

Machine learning models often encounter distribution shifts when deploye...
research
07/11/2018

Quantification under prior probability shift: the ratio estimator and its extensions

The quantification problem consists of determining the prevalence of a g...
research
12/28/2017

Kernel Robust Bias-Aware Prediction under Covariate Shift

Under covariate shift, training (source) data and testing (target) data ...
research
03/04/2021

Distribution-free uncertainty quantification for classification under label shift

Trustworthy deployment of ML models requires a proper measure of uncerta...
research
03/17/2020

A Unified View of Label Shift Estimation

Label shift describes the setting where although the label distribution ...
research
05/26/2022

Undersampling is a Minimax Optimal Robustness Intervention in Nonparametric Classification

While a broad range of techniques have been proposed to tackle distribut...
research
11/07/2022

A Semiparametric Efficient Approach To Label Shift Estimation and Quantification

Transfer Learning is an area of statistics and machine learning research...

Please sign up or login with your details

Forgot password? Click here to reset