A Lower Bound for Estimating High Moments of a Data Stream
Abstract
We show an improved lower bound for the Fp estimation problem in a data stream setting for p>2. A data stream is a sequence of items from the domain [n] with possible repetitions. The frequency vector x is an n-dimensional non-negative integer vector x such that x(i) is the number of occurrences of i in the sequence. Given an accuracy parameter Omega(n-1/p) < ε < 1, the problem of estimating Fp is to estimate xpp = Σi ∈ [n] x(i)p correctly to within a relative accuracy of 1 ε with high constant probability in an online fashion and using as little space as possible. The current space lower bound for this problem is Omega(n1-2/p ε-2/p+ n1-2/pε-4/p/ O(1)(n)+ (ε-2 + (n))). The first term in the lower bound expression was proved in B-YJKS:stoc02,cks:ccc03, the second in wz:arxiv11 and the third in wood:soda04. In this note, we show an Omega(p2 n1-2/p ε-2/ (n)) bits space bound, for Omega(pn-1/p) ε 1/10.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.