Towards Backwards-Compatible Data with Confounded Domain Adaptation

03/23/2022
by   Calvin McCarter, et al.
0

Most current domain adaptation methods address either covariate shift or label shift, but are not applicable where they occur simultaneously and are confounded with each other. Domain adaptation approaches which do account for such confounding are designed to adapt covariates to optimally predict a particular label whose shift is confounded with covariate shift. In this paper, we instead seek to achieve general-purpose data backwards compatibility. This would allow the adapted covariates to be used for a variety of downstream problems, including on pre-existing prediction models and on data analytics tasks. To do this we consider a special case of generalized label shift (GLS), which we call confounded shift. We present a novel framework for this problem, based on minimizing the expected divergence between the source and target conditional distributions, conditioning on possible confounders. Within this framework, we propose using the reverse Kullback-Leibler divergence, demonstrating the use of parametric and nonparametric Gaussian estimators of the conditional distribution. We also propose using the Maximum Mean Discrepancy (MMD), introducing a dynamic strategy for choosing the kernel bandwidth, which is applicable even outside the confounded shift setting. Finally, we demonstrate our approach on synthetic and real datasets.

READ FULL TEXT

page 12

page 26

research
10/23/2019

Generalized Domain Adaptation with Covariate and Label Shift CO-ALignment

Unsupervised knowledge transfer has a great potential to improve the gen...
research
03/05/2019

Domain Adaptation with Asymmetrically-Relaxed Distribution Alignment

Domain adaptation addresses the common problem when the target distribut...
research
06/06/2022

Class Prior Estimation under Covariate Shift – no Problem?

We show that in the context of classification the property of source and...
research
06/25/2021

Domain Conditional Predictors for Domain Adaptation

Learning guarantees often rely on assumptions of i.i.d. data, which will...
research
09/21/2018

Intractable Likelihood Regression for Covariate Shift by Kernel Mean Embedding

Simulation plays an essential role in comprehending a target system in m...
research
02/26/2022

Generalized Label Shift Correction via Minimum Uncertainty Principle: Theory and Algorithm

As a fundamental problem in machine learning, dataset shift induces a pa...
research
05/30/2023

ELSA: Efficient Label Shift Adaptation through the Lens of Semiparametric Models

We study the domain adaptation problem with label shift in this work. Un...

Please sign up or login with your details

Forgot password? Click here to reset