Near-Optimality for Single-Source Personalized PageRank
Abstract
The Single-Source Personalized PageRank (SSPPR) query is central to graph OLAP, measuring the probability π(s,t) that an α-decay random walk from node s terminates at t. Despite decades of research, a significant gap remains between upper and lower bounds for its computational complexity. Existing upper bounds are O(((1/ε)ε2, m nε, m 1ε)) for SSPPR-A and O(((1/n)δ, m (n/δ), m ((n)mδ))) for SSPPR-R, with trivial lower bounds of ((n,1/ε)) and ((n,1/δ)). This work narrows or closes this gap. We improve the upper bounds for SSPPR-A and SSPPR-R to O(1ε2) and O(((1/δ)δ, m + n (n) ((n)mδ))), respectively, offering improvements by factors of (1/ε) and ((n)mδ). On the lower bound side, we establish stronger results: ((m, 1/ε2)) for SSPPR-A and ((m, (1/δ)δ)) for SSPPR-R, strengthening theoretical foundations. Our upper and lower bounds for SSPPR-R coincide for graphs with m ∈ (n 2 n) and any threshold δ, 1/δ ∈ O(poly(n)), achieving theoretical optimality in most graph regimes. The SSPPR-A query attains partial optimality for large error thresholds, matching our new lower bound. This is the first optimal result for SSPPR queries. Our techniques generalize to the Single-Target Personalized PageRank (STPPR) query, improving its lower bound from ((n, 1/δ)) to ((m, nδ n)), matching the upper bound and revealing its optimality.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.