Regularized Learning for Domain Adaptation under Label Shifts

03/22/2019
by   Kamyar Azizzadenesheli, et al.
0

We propose Regularized Learning under Label shifts (RLLS), a principled and a practical domain-adaptation algorithm to correct for shifts in the label distribution between a source and a target domain. We first estimate importance weights using labeled source data and unlabeled target data, and then train a classifier on the weighted source samples. We derive a generalization bound for the classifier on the target domain which is independent of the (ambient) data dimensions, and instead only depends on the complexity of the function class. To the best of our knowledge, this is the first generalization bound for the label-shift problem where the labels in the target domain are not available. Based on this bound, we propose a regularized estimator for the small-sample regime which accounts for the uncertainty in the estimated weights. Experiments on the CIFAR-10 and MNIST datasets show that RLLS improves classification accuracy, especially in the low sample and large-shift regimes, compared to previous methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2020

Importance Weight Estimation and Generalization in Domain Adaptation under Label Shift

We study generalization under label shift in domain adaptation where the...
research
10/23/2020

Coping with Label Shift via Distributionally Robust Optimisation

The label shift problem refers to the supervised learning setting where ...
research
02/26/2020

Understanding Self-Training for Gradual Domain Adaptation

Machine learning systems must adapt to data distributions that evolve ov...
research
02/06/2023

RLSbench: Domain Adaptation Under Relaxed Label Shift

Despite the emergence of principled methods for domain adaptation under ...
research
03/23/2020

Minimax optimal approaches to the label shift problem

We study minimax rates of convergence in the label shift problem. In add...
research
01/11/2022

Leveraging Unlabeled Data to Predict Out-of-Distribution Performance

Real-world machine learning deployments are characterized by mismatches ...
research
05/18/2023

Minimum-Risk Recalibration of Classifiers

Recalibrating probabilistic classifiers is vital for enhancing the relia...

Please sign up or login with your details

Forgot password? Click here to reset