Stacking Models for Nearly Optimal Link Prediction in Complex Networks

09/17/2019
by   Amir Ghasemian, et al.
16

Most real-world networks are incompletely observed. Algorithms that can accurately predict which links are missing can dramatically speedup the collection of network data and improve the validity of network models. Many algorithms now exist for predicting missing links, given a partially observed network, but it has remained unknown whether a single best predictor exists, how link predictability varies across methods and networks from different domains, and how close to optimality current methods are. We answer these questions by systematically evaluating 203 individual link predictor algorithms, representing three popular families of methods, applied to a large corpus of 548 structurally diverse networks from six scientific domains. We first show that individual algorithms exhibit a broad diversity of prediction errors, such that no one predictor or family is best, or worst, across all realistic inputs. We then exploit this diversity via meta-learning to construct a series of "stacked" models that combine predictors into a single algorithm. Applied to a broad range of synthetic networks, for which we may analytically calculate optimal performance, these stacked models achieve optimal or nearly optimal levels of accuracy. Applied to real-world networks, stacked models are also superior, but their accuracy varies strongly by domain, suggesting that link prediction may be fundamentally easier in social networks than in biological or technological networks. These results indicate that the state-of-the-art for link prediction comes from combining individual algorithms, which achieves nearly optimal predictions. We close with a brief discussion of limitations and opportunities for further improvement of these results.

READ FULL TEXT

page 1

page 18

page 19

research
03/06/2019

Signed Link Prediction with Sparse Data: The Role of Personality Information

Predicting signed links in social networks often faces the problem of si...
research
11/16/2020

Link prediction in multiplex networks via triadic closure

Link prediction algorithms can help to understand the structure and dyna...
research
10/20/2022

Generalized Reciprocal Perspective

Across many domains, real-world problems can be represented as a network...
research
07/15/2022

An Approach for Link Prediction in Directed Complex Networks based on Asymmetric Similarity-Popularity

Complex networks are graphs representing real-life systems that exhibit ...
research
09/30/2014

Predicting missing links via correlation between nodes

As a fundamental problem in many different fields, link prediction aims ...
research
10/01/2018

Classification Using Link Prediction

Link prediction in a graph is the problem of detecting the missing links...
research
06/30/2022

The maximum capability of a topological feature in link prediction

Link prediction aims to predict links of a network that are not directly...

Please sign up or login with your details

Forgot password? Click here to reset