An Information-theoretical Approach to Semi-supervised Learning under Covariate-shift

02/24/2022
by   Gholamali Aminian, et al.
7

A common assumption in semi-supervised learning is that the labeled, unlabeled, and test data are drawn from the same distribution. However, this assumption is not satisfied in many applications. In many scenarios, the data is collected sequentially (e.g., healthcare) and the distribution of the data may change over time often exhibiting so-called covariate shifts. In this paper, we propose an approach for semi-supervised learning algorithms that is capable of addressing this issue. Our framework also recovers some popular methods, including entropy minimization and pseudo-labeling. We provide new information-theoretical based generalization error upper bounds inspired by our novel framework. Our bounds are applicable to both general semi-supervised learning and the covariate-shift scenario. Finally, we show numerically that our method outperforms previous approaches proposed for semi-supervised learning under the covariate shift.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2012

On Causal and Anticausal Learning

We consider the problem of function estimation in the case where an unde...
research
05/04/2022

Estimation of prediction error with known covariate shift

In supervised learning, the estimation of prediction error on unlabeled ...
research
12/12/2011

Robust Learning via Cause-Effect Models

We consider the problem of function estimation in the case where the dat...
research
06/07/2023

Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications

In many machine learning systems that jointly learn from multiple modali...
research
09/05/2017

Discriminative Similarity for Clustering and Semi-Supervised Learning

Similarity-based clustering and semi-supervised learning methods separat...
research
06/23/2016

Semi-supervised Inference: General Theory and Estimation of Means

We propose a general semi-supervised inference framework focused on the ...
research
07/13/2017

On Measuring and Quantifying Performance: Error Rates, Surrogate Loss, and an Example in SSL

In various approaches to learning, notably in domain adaptation, active ...

Please sign up or login with your details

Forgot password? Click here to reset