The Optimal Quantile Estimator for Compressed Counting
Abstract
Compressed Counting (CC) was recently proposed for very efficiently computing the (approximate) αth frequency moments of data streams, where 0<α <= 2. Several estimators were reported including the geometric mean estimator, the harmonic mean estimator, the optimal power estimator, etc. The geometric mean estimator is particularly interesting for theoretical purposes. For example, when α -> 1, the complexity of CC (using the geometric mean estimator) is O(1/ε), breaking the well-known large-deviation bound O(1/ε2). The case α≈ 1 has important applications, for example, computing entropy of data streams. For practical purposes, this study proposes the optimal quantile estimator. Compared with previous estimators, this estimator is computationally more efficient and is also more accurate when α> 1.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.