Covariate shift in nonparametric regression with Markovian design

07/17/2023
by   Lukas Trottner, et al.
0

Covariate shift in regression problems and the associated distribution mismatch between training and test data is a commonly encountered phenomenon in machine learning. In this paper, we extend recent results on nonparametric convergence rates for i.i.d. data to Markovian dependence structures. We demonstrate that under Hölder smoothness assumptions on the regression function, convergence rates for the generalization risk of a Nadaraya-Watson kernel estimator are determined by the similarity between the invariant distributions associated to source and target Markov chains. The similarity is explicitly captured in terms of a bandwidth-dependent similarity measure recently introduced in Pathak, Ma and Wainwright [ICML, 2022]. Precise convergence rates are derived for the particular cases of finite Markov chains and spectral gap Markov chains for which the similarity measure between their invariant distributions grows polynomially with decreasing bandwidth. For the latter, we extend the notion of a distribution transfer exponent from Kpotufe and Martinet [Ann. Stat., 49(6), 2021] to kernel transfer exponents of uniformly ergodic Markov chains in order to generate a rich class of Markov kernel pairs for which convergence guarantees for the covariate shift problem can be formulated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2022

A new similarity measure for covariate shift with applications to nonparametric regression

We study covariate shift in the context of nonparametric regression. We ...
research
04/11/2022

Local convergence rates of the least squares estimator with applications to transfer learning

Convergence properties of empirical risk minimizers can be conveniently ...
research
05/06/2022

Optimally tackling covariate shift in RKHS-based nonparametric regression

We study the covariate shift problem in the context of nonparametric reg...
research
06/06/2018

Rademacher complexity for Markov chains : Applications to kernel smoothing and Metropolis-Hasting

Following the seminal approach by Talagrand, the concept of Rademacher c...
research
12/15/2020

Minimax Risk and Uniform Convergence Rates for Nonparametric Dyadic Regression

Let i=1,…,N index a simple random sample of units drawn from some large ...
research
11/30/2019

Mix and Match: Markov Chains Mixing Times for Matching in Rideshare

Rideshare platforms such as Uber and Lyft dynamically dispatch drivers t...
research
05/07/2023

Classification Tree Pruning Under Covariate Shift

We consider the problem of pruning a classification tree, that is, selec...

Please sign up or login with your details

Forgot password? Click here to reset