Polynomial Estimators for High Frequency Moments

Abstract

We present an algorithm for computing Fp, the pth moment of an n-dimensional frequency vector of a data stream, for 2 < p < (n) , to within 1 ε factors, ε ∈ [n-1/p,1] with high constant probability. Let m be the number of stream records and M be the largest magnitude of a stream update. The algorithm uses space in bits O(p2ε-2n1-2/pE(p,n) (n) (nmM)/( (n),ε4/p-2)) where, E(p,n) = (1-2/p)-1(1-n-4(1-2/p). Here E(p,n) is O(1) for p = 2+(1) and O( n) for p = 2 + O(1/ (n). This improves upon the space required by current algorithms iw:stoc05,bgks:soda06,ako:arxiv10,bo:arxiv10 by a factor of at least (ε-4/p ( (n), ε4/p-2)). The update time is O( (n)). We use a new technique for designing estimators for functions of the form (X), where, X is a random variable and is a smooth function, based on a low-degree Taylor polynomial expansion of (X) around an estimate of X.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…