Separations for Estimating Large Frequency Moments on Data Streams

Abstract

We study the classical problem of moment estimation of an underlying vector whose n coordinates are implicitly defined through a series of updates in a data stream. We show that if the updates to the vector arrive in the random-order insertion-only model, then there exist space efficient algorithms with improved dependencies on the approximation parameter . In particular, for any real p > 2, we first obtain an algorithm for Fp moment estimation using O(14/p· n1-2/p) bits of memory. Our techniques also give algorithms for Fp moment estimation with p>2 on arbitrary order insertion-only and turnstile streams, using O(14/p· n1-2/p) bits of space and two passes, which is the first optimal multi-pass Fp estimation algorithm up to n factors. Finally, we give an improved lower bound of (12· n1-2/p) for one-pass insertion-only streams. Our results separate the complexity of this problem both between random and non-random orders, as well as one-pass and multi-pass streams.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…