The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift

08/03/2022
by   Jingfeng Wu, et al.
4

We study linear regression under covariate shift, where the marginal distribution over the input covariates differs in the source and the target domains, while the conditional distribution of the output given the input covariates is similar across the two domains. We investigate a transfer learning approach with pretraining on the source data and finetuning based on the target data (both conducted by online SGD) for this problem. We establish sharp instance-dependent excess risk upper and lower bounds for this approach. Our bounds suggest that for a large class of linear regression instances, transfer learning with O(N^2) source data (and scarce or no target data) is as effective as supervised learning with N target data. In addition, we show that finetuning, even with only a small amount of target data, could drastically reduce the amount of source data required by pretraining. Our theory sheds light on the effectiveness and limitation of pretraining as well as the benefits of finetuning for tackling covariate shift problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2022

Class Prior Estimation under Covariate Shift – no Problem?

We show that in the context of classification the property of source and...
research
02/06/2022

A new similarity measure for covariate shift with applications to nonparametric regression

We study covariate shift in the context of nonparametric regression. We ...
research
03/05/2018

Marginal Singularity, and the Benefits of Labels in Covariate-Shift

We present new minimax results that concisely capture the relative benef...
research
07/01/2023

Unified Transfer Learning Models for High-Dimensional Linear Regression

Transfer learning plays a key role in modern data analysis when: (1) the...
research
02/23/2022

A Class of Geometric Structures in Transfer Learning: Minimax Bounds and Optimality

We study the problem of transfer learning, observing that previous effor...
research
02/10/2022

Transfer-Learning Across Datasets with Different Input Dimensions: An Algorithm and Analysis for the Linear Regression Case

With the development of new sensors and monitoring devices, more sources...
research
12/28/2020

Learning by Ignoring

Learning by ignoring, which identifies less important things and exclude...

Please sign up or login with your details

Forgot password? Click here to reset