Fast Computation of Katz Index for Efficient Processing of Link Prediction Queries

12/13/2019
by   Mustafa Coskun, et al.
0

Network proximity computations are among the most common operations in various data mining applications, including link prediction and collaborative filtering. A common measure of network proximity is Katz index, which has been shown to be among the best-performing path-based link prediction algorithms. With the emergence of very large network databases, such proximity computations become an important part of query processing in these databases. Consequently, significant effort has been devoted to developing algorithms for efficient computation of Katz index between a given pair of nodes or between a query node and every other node in the network. Here, we present LRC-Katz, an algorithm based on indexing and low-rank correction to accelerate Katz index-based network proximity queries. Using a variety of very large real-world networks, we show that LRC-Katz outperforms the fastest existing method, Conjugate Gradient, for a wide range of parameter values. We also show that this acceleration in the computation of Katz index can be used to drastically improve the efficiency of processing link prediction queries in very large networks. Motivated by this observation, we propose a new link prediction algorithm that exploits modularity of networks that are encountered in practical applications. Our experimental results on the link prediction problem show that our modularity based algorithm significantly outperforms the state-of-the-art link prediction Katz method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/04/2020

ALPINE: Active Link Prediction using Network Embedding

Many real-world problems can be formalized as predicting links in a part...
research
03/14/2021

Collaborative Filtering Approach to Link Prediction

Link prediction is a fundamental challenge in network science. Among var...
research
08/13/2019

Growth of Common Friends in a Preferential Attachment Model

The number of common friends (or connections) in a graph is a commonly u...
research
05/07/2019

PRSim: Sublinear Time SimRank Computation on Large Power-Law Graphs

SimRank is a classic measure of the similarities of nodes in a graph. G...
research
05/27/2023

DotHash: Estimating Set Similarity Metrics for Link Prediction and Document Deduplication

Metrics for set similarity are a core aspect of several data mining task...
research
12/22/2019

Link prediction in dynamic networks using random dot product graphs

The problem of predicting links in large networks is a crucial task in a...
research
09/01/2018

Improving Visual Relationship Detection using Semantic Modeling of Scene Descriptions

Structured scene descriptions of images are useful for the automatic pro...

Please sign up or login with your details

Forgot password? Click here to reset