Recursive Sketching For Frequency Moments

Abstract

In a ground-breaking paper, Indyk and Woodruff (STOC 05) showed how to compute Fk (for k>2) in space complexity O( poly-log(n,m)· n1-2k), which is optimal up to (large) poly-logarithmic factors in n and m, where m is the length of the stream and n is the upper bound on the number of distinct elements in a stream. The best known lower bound for large moments is ((n)n1-2k). A follow-up work of Bhuvanagiri, Ganguly, Kesh and Saha (SODA 2006) reduced the poly-logarithmic factors of Indyk and Woodruff to O(2(m)· ( n+ m)· n1-2 k). Further reduction of poly-log factors has been an elusive goal since 2006, when Indyk and Woodruff method seemed to hit a natural "barrier." Using our simple recursive sketch, we provide a different yet simple approach to obtain a O((m)(nm)· ( n)4· n1-2 k) algorithm for constant ε (our bound is, in fact, somewhat stronger, where the ( n) term can be replaced by any constant number of iterations instead of just two or three, thus approaching log*n. Our bound also works for non-constant ε (for details see the body of the paper). Further, our algorithm requires only 4-wise independence, in contrast to existing methods that use pseudo-random generators for computing large frequency moments.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…