Core-fringe link prediction

11/28/2018
by   Austin R. Benson, et al.
0

Data collection often involves the partial measurement of a larger system. A common example arises in the process collecting network data: we often obtain network datasets by recording all of the interactions among a small set of core nodes, so that we end up with a measurement of the network consisting of these core nodes together with a potentially much larger set of fringe nodes that have links to the core. Given the ubiquity of this process for assembling network data, it becomes crucial to understand the role of such a core-fringe structure. Here we study how the inclusion of fringe nodes affects the standard task of network link prediction. One might initially think the inclusion of any additional data is useful, and hence that it should be beneficial to include all fringe nodes that are available. However, we find that this is not true; in fact, there is substantial variability in the value of the fringe nodes for prediction. In some datasets, once an algorithm is selected, including any additional data from the fringe can actually hurt prediction performance; in other datasets, including some amount of fringe information is useful before prediction performance saturates or even declines; and in further cases, including the entire fringe leads to the best performance. While such variety might seem surprising, we show that these behaviors are exhibited by simple random graph models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/20/2018

Simplicial Closure and Higher-order Link Prediction

Networks provide a powerful formalism for modeling complex systems, by r...
research
05/14/2019

Planted Hitting Set Recovery in Hypergraphs

In various application areas, networked data is collected by measuring i...
research
05/03/2018

Found Graph Data and Planted Vertex Covers

A typical way in which network data is recorded is to measure all the in...
research
11/04/2022

Rethinking the positive role of cluster structure in complex networks for link prediction tasks

Clustering is a fundamental problem in network analysis that finds close...
research
07/23/2020

On a Bernoulli Autoregression Framework for Link Discovery and Prediction

We present a dynamic prediction framework for binary sequences that is b...
research
08/01/2022

Event Notifications in Value-Adding Networks

Linkages between research outputs are crucial in the scholarly knowledge...
research
03/08/2021

Threshold-based Network Structural Dynamics

The interest in dynamic processes on networks is steadily rising in rece...

Please sign up or login with your details

Forgot password? Click here to reset