On the Validity of Conformal Prediction for Network Data Under Non-Uniform Sampling

06/12/2023
by   Robert Lunde, et al.
0

We study the properties of conformal prediction for network data under various sampling mechanisms that commonly arise in practice but often result in a non-representative sample of nodes. We interpret these sampling mechanisms as selection rules applied to a superpopulation and study the validity of conformal prediction conditional on an appropriate selection event. We show that the sampled subarray is exchangeable conditional on the selection event if the selection rule satisfies a permutation invariance property and a joint exchangeability condition holds for the superpopulation. Our result implies the finite-sample validity of conformal prediction for certain selection events related to ego networks and snowball sampling. We also show that when data are sampled via a random walk on a graph, a variant of weighted conformal prediction yields asymptotically valid prediction sets for an independently selected node from the population.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2018

Balanced Allocation with Random Walk Based Sampling

In the standard ball-in-bins experiment, a well-known scheme is to sampl...
research
02/15/2021

Approximation to Object Conditional Validity with Conformal Predictors

Conformal predictors are machine learning algorithms that output predict...
research
04/16/2017

Random Walk Sampling for Big Data over Networks

It has been shown recently that graph signals with small total variation...
research
05/12/2022

Sampling Online Social Networks: Metropolis Hastings Random Walk and Random Walk

As social network analysis (SNA) has drawn much attention in recent year...
research
02/20/2023

Conformal Prediction for Network-Assisted Regression

An important problem in network analysis is predicting a node attribute ...
research
11/13/2017

Estimating prediction error for complex samples

Non-uniform random samples are commonly generated in multiple scientific...
research
10/13/2017

Edge sampling using network local information

Edge sampling is an important topic in network analysis. It provides a n...

Please sign up or login with your details

Forgot password? Click here to reset