Rethinking Importance Weighting for Deep Learning under Distribution Shift

06/08/2020
by   Tongtong Fang, et al.
0

Under distribution shift (DS) where the training data distribution differs from the test one, a powerful technique is importance weighting (IW) which handles DS in two separate steps: weight estimation (WE) estimates the test-over-training density ratio and weighted classification (WC) trains the classifier from weighted training data. However, IW cannot work well on complex data, since WE is incompatible with deep learning. In this paper, we rethink IW and theoretically show it suffers from a circular dependency: we need not only WE for WC, but also WC for WE where a trained deep classifier is used as the feature extractor (FE). To cut off the dependency, we try to pretrain FE from unweighted training data, which leads to biased FE. To overcome the bias, we propose an end-to-end solution dynamic IW that iterates between WE and WC and combines them in a seamless manner, and hence our WE can also enjoy deep networks and stochastic optimizers indirectly. Experiments with two representative DSs on Fashion-MNIST and CIFAR-10/100 demonstrate that dynamic IW compares favorably with state-of-the-art methods.

READ FULL TEXT

page 9

page 24

research
11/01/2018

Bias Reduction via End-to-End Shift Learning: Application to Citizen Science

Citizen science projects are successful at gathering rich datasets for v...
research
06/23/2019

Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting

A learned generative model often produces biased statistics relative to ...
research
12/19/2021

Rethinking Importance Weighting for Transfer Learning

A key assumption in supervised learning is that training and test data f...
research
05/24/2023

Generalizing Importance Weighting to A Universal Solver for Distribution Shift Problems

Distribution shift (DS) may have two levels: the distribution itself cha...
research
03/28/2021

Understanding the role of importance weighting for deep learning

The recent paper by Byrd Lipton (2019), based on empirical observati...
research
06/16/2018

Stable Prediction across Unknown Environments

In many important machine learning applications, the training distributi...
research
06/07/2018

Importance weighted generative networks

Deep generative networks can simulate from a complex target distribution...

Please sign up or login with your details

Forgot password? Click here to reset