Importance Sampling with Unequal Support

11/10/2016
by   Philip S. Thomas, et al.
0

Importance sampling is often used in machine learning when training and testing data come from different distributions. In this paper we propose a new variant of importance sampling that can reduce the variance of importance sampling-based estimates by orders of magnitude when the supports of the training and testing distributions differ. After motivating and presenting our new importance sampling estimator, we provide a detailed theoretical analysis that characterizes both its bias and variance relative to the ordinary importance sampling estimator (in various settings, which include cases where ordinary importance sampling is biased, while our new estimator is not, and vice versa). We conclude with an example of how our new importance sampling estimator can be used to improve estimates of how well a new treatment policy for diabetes will work for an individual, using only data from when the individual used a previous treatment policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2012

AND/OR Importance Sampling

The paper introduces AND/OR importance sampling for probabilistic graphi...
research
06/04/2018

Importance Sampling Policy Evaluation with an Estimated Behavior Policy

In reinforcement learning, off-policy evaluation is the task of using da...
research
10/25/2018

Finite-sample Guarantees for Winsorized Importance Sampling

Importance sampling is a widely used technique to estimate the propertie...
research
02/06/2016

Importance Sampling for Minibatches

Minibatching is a very well studied and highly popular technique in supe...
research
01/10/2013

Policy Improvement for POMDPs Using Normalized Importance Sampling

We present a new method for estimating the expected return of a POMDP fr...
research
10/20/2019

Amortized Rejection Sampling in Universal Probabilistic Programming

Existing approaches to amortized inference in probabilistic programs wit...
research
09/24/2021

Sample Efficient Model Evaluation

Labelling data is a major practical bottleneck in training and testing c...

Please sign up or login with your details

Forgot password? Click here to reset