Stable Prediction with Model Misspecification and Agnostic Distribution Shift

01/31/2020
by   Kun Kuang, et al.
95

For many machine learning algorithms, two main assumptions are required to guarantee performance. One is that the test data are drawn from the same distribution as the training data, and the other is that the model is correctly specified. In real applications, however, we often have little prior knowledge on the test data and on the underlying true model. Under model misspecification, agnostic distribution shift between training and test data leads to inaccuracy of parameter estimation and instability of prediction across unknown test data. To address these problems, we propose a novel Decorrelated Weighting Regression (DWR) algorithm which jointly optimizes a variable decorrelation regularizer and a weighted regression model. The variable decorrelation regularizer estimates a weight for each sample such that variables are decorrelated on the weighted training data. Then, these weights are used in the weighted regression to improve the accuracy of estimation on the effect of each variable, thus help to improve the stability of prediction across unknown test data. Extensive experiments clearly demonstrate that our DWR algorithm can significantly improve the accuracy of parameter estimation and stability of prediction with model misspecification and agnostic distribution shift.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2020

Balance-Subsampled Stable Prediction

In machine learning, it is commonly assumed that training and test data ...
research
07/17/2021

Minimising quantifier variance under prior probability shift

For the binary prevalence quantification problem under prior probability...
research
11/02/2019

Model Specification Test with Unlabeled Data: Approach from Covariate Shift

We propose a novel framework of the model specification test in regressi...
research
05/31/2023

A Bayesian Perspective On Training Data Attribution

Training data attribution (TDA) techniques find influential training dat...
research
02/27/2020

To be or not to be? A spatial predictive crime model for Rochester

This project uses a spatial model (Geographically Weighted Regression) t...
research
02/08/2022

Conformal prediction for the design problem

In many real-world deployments of machine learning, we use a prediction ...
research
07/30/2020

Stable Learning via Causality-based Feature Rectification

How to learn a stable model under agnostic distribution shift between tr...

Please sign up or login with your details

Forgot password? Click here to reset