Near-Optimal Two-Pass Streaming Algorithm for Sampling Random Walks over Directed Graphs

Abstract

For a directed graph G with n vertices and a start vertex u start, we wish to (approximately) sample an L-step random walk over G starting from u start with minimum space using an algorithm that only makes few passes over the edges of the graph. This problem found many applications, for instance, in approximating the PageRank of a webpage. If only a single pass is allowed, the space complexity of this problem was shown to be (n · L). Prior to our work, a better space complexity was only known with O(L) passes. We settle the space complexity of this random walk simulation problem for two-pass streaming algorithms, showing that it is (n · L), by giving almost matching upper and lower bounds. Our lower bound argument extends to every constant number of passes p, and shows that any p-pass algorithm for this problem uses (n · L1/p) space. In addition, we show a similar (n · L) bound on the space complexity of any algorithm (with any number of passes) for the related problem of sampling an L-step random walk from every vertex in the graph.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…