Approximating Single-Source Personalized PageRank with Absolute Error Guarantees

Abstract

Personalized PageRank (PPR) is an extensively studied and applied node proximity measure in graphs. For a pair of nodes s and t on a graph G=(V,E), the PPR value π(s,t) is defined as the probability that an α-discounted random walk from s terminates at t, where the walk terminates with probability α at each step. We study the classic Single-Source PPR query, which asks for PPR approximations from a given source node s to all nodes in the graph. Specifically, we aim to provide approximations with absolute error guarantees, ensuring that the resultant PPR estimates π(s,t) satisfy t∈ V|π(s,t)-π(s,t)| for a given error bound . We propose an algorithm that achieves this with high probability, with an expected running time of - O(m/) for directed graphs, where m=|E|; - O(dmax/) for undirected graphs, where dmax is the maximum node degree in the graph; - O(nγ-1/2/) for power-law graphs, where n=|V| and γ∈(12,1) is the extent of the power law. These sublinear bounds improve upon existing results. We also study the case when degree-normalized absolute error guarantees are desired, requiring t∈ V|π(s,t)/d(t)-π(s,t)/d(t)|d for a given error bound d, where the graph is undirected and d(t) is the degree of node t. We give an algorithm that provides this error guarantee with high probability, achieving an expected complexity of O(Σt∈ Vπ(s,t)/d(t)/d). This improves over the previously known O(1/d) complexity.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…