PageRank Centrality in Directed Graphs with Bounded In-Degree
Abstract
We study the computational complexity of locally estimating a node's PageRank centrality in a directed graph G. For any node t, its PageRank centrality π(t) is defined as the probability that a random walk in G, starting from a uniformly chosen node, terminates at t, where each step terminates with a constant probability α∈(0,1). To obtain a multiplicative (1 O(1))-approximation of π(t) with probability (1), the previously best upper bound is O(n1/2\ in1/2,out1/2,m1/4\) from [Wang, Wei, Wen, Yang, STOC '24], where n and m denote the number of nodes and edges in G, and in and out upper bound the in-degrees and out-degrees of G, respectively. Using a refinement of the proof in the same paper, we establish a lower bound of (n1/2\in1/2/nγ,out1/2/nγ,m1/4\), where γ=12(2\1/(1-α)in,1\-1)-1. As γ only depends on in and nγ=O(1) for in=(n(1)), the known upper bound is tight if we only parameterize the complexity by n, m, and out. However, there remains a gap of (nγ) when considering in, and this gap is large when in is small. In the extreme case where in1/(1-α), we have γ=1/2, leading to a gap of (n1/2) between the bounds O(n1/2) and (1). In this paper, we present a new algorithm that achieves the above lower bound (up to logarithmic factors). The algorithm assumes that n and the bounds in and out are known in advance. Our key technique is a novel randomized backwards propagation process that only propagates selectively based on Monte Carlo estimated PageRank scores.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.