General regularization in covariate shift adaptation

07/21/2023
by   Duc Hoan Nguyen, et al.
0

Sample reweighting is one of the most widely used methods for correcting the error of least squares learning algorithms in reproducing kernel Hilbert spaces (RKHS), that is caused by future data distributions that are different from the training data distribution. In practical situations, the sample weights are determined by values of the estimated Radon-Nikodým derivative, of the future data distribution w.r.t. the training data distribution. In this work, we review known error bounds for reweighted kernel regression in RKHS and obtain, by combination, novel results. We show under weak smoothness conditions, that the amount of samples, needed to achieve the same order of accuracy as in the standard supervised learning without differences in data distributions, is smaller than proven by state-of-the-art analyses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2022

Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems

One of the key challenges of learning an online recommendation model is ...
research
05/15/2023

Double-Weighting for Covariate Shift Adaptation

Supervised learning is often affected by a covariate shift in which the ...
research
10/04/2018

Correcting the bias in least squares regression with volume-rescaled sampling

Consider linear regression where the examples are generated by an unknow...
research
05/07/2023

Classification Tree Pruning Under Covariate Shift

We consider the problem of pruning a classification tree, that is, selec...
research
08/06/2019

Semiparametric Wavelet-based JPEG IV Estimator for endogenously truncated data

A new and an enriched JPEG algorithm is provided for identifying redunda...
research
10/12/2022

How Much Data Are Augmentations Worth? An Investigation into Scaling Laws, Invariance, and Implicit Regularization

Despite the clear performance benefits of data augmentations, little is ...
research
06/07/2022

Generalized Data Distribution Iteration

To obtain higher sample efficiency and superior final performance simult...

Please sign up or login with your details

Forgot password? Click here to reset