On rank statistics of PageRank and MarkovRank

08/05/2021
by   Yoichi Nishiyama, et al.
0

An important statistic in analyzing some (finite) network data, called PageRank, and a related new statistic, which we call MarkovRank, are studied in this paper. The PageRank was originally developed by the cofounders of Google, Sergey Brin and Larry Page, to optimize the ranking of websites for their search engine outcomes, and it is computed using an iterative algorithm, based on the idea that nodes with a larger number of incoming edges are more important. The aim of this paper is to analyze the common features and some significant differences between the PageRank and the new Rank. A common merit of the two Ranks is that both statistics can be easily computed by either the mathematical computation or the iterative algorithm. According to the analysis of some examples, these two statistics seem to return somewhat different values, but the resulting rank statistics of both statistics are not far away from each other. One of the differences is that only MarkovRank has the property that its rank statistic does not depend on any tuning parameter, and it is determined only through the adjacency matrix for given network data. Moreover, it is also shown that the rank statistic of MarkovRank is identical to or "finer than" that of the stationary distribution vector of the corresponding Markov chain (with finite state space) whenever the Markov chain is regular. Thus, MarkovRank may have a potential to play a similar role to the well established PageRank from a practical point of view, not only commonly with light computational tasks, but also with some new theoretical validity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2022

Markov Chain Monte Carlo for generating ranked textual data

This paper faces a central theme in applied statistics and information s...
research
01/19/2018

Distribution-free runs-based control charts

We propose distribution-free runs-based control charts for detecting loc...
research
07/29/2023

Shared Information for a Markov Chain on a Tree

Shared information is a measure of mutual dependence among multiple join...
research
12/21/2018

Revisiting the Gelman-Rubin Diagnostic

Gelman and Rubin's (1992) convergence diagnostic is one of the most popu...
research
07/11/2022

Testing Independence of Bivariate Censored Data using Random Walk on Restricted Permutation Graph

In this paper, we propose a procedure to test the independence of bivari...
research
01/22/2022

Estimation of the covariance structure from SNP allele frequencies

We propose two new statistics, V and S, to disentangle the population hi...

Please sign up or login with your details

Forgot password? Click here to reset