An Improved Interactive Streaming Algorithm for the Distinct Elements Problem

Abstract

The exact computation of the number of distinct elements (frequency moment F0) is a fundamental problem in the study of data streaming algorithms. We denote the length of the stream by n where each symbol is drawn from a universe of size m. While it is well known that the moments F0,F1,F2 can be approximated by efficient streaming algorithms, it is easy to see that exact computation of F0,F2 requires space (m). In previous work, Cormode et al. therefore considered a model where the data stream is also processed by a powerful helper, who provides an interactive proof of the result. They gave such protocols with a polylogarithmic number of rounds of communication between helper and verifier for all functions in NC. This number of rounds (O(2 m) \;in the case of \;F0 ) can quickly make such protocols impractical. Cormode et al. also gave a protocol with m +1 rounds for the exact computation of F0 where the space complexity is O( m n+2 m) but the total communication O(n m( n+ m )). They managed to give m round protocols with polylog(m,n) complexity for many other interesting problems including F2, Inner product, and Range-sum, but computing F0 exactly with polylogarithmic space and communication and O( m) rounds remained open. In this work, we give a streaming interactive protocol with m rounds for exact computation of F0 using O( m (\, n + m m\,)) bits of space and the communication is O( m (\, n +3 m ( m)2 \,)). The update time of the verifier per symbol received is O(2 m).

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…