Memory Complexity of Estimating Entropy and Mutual Information

Abstract

We observe an infinite sequence of independent identically distributed random variables X1,X2,… drawn from an unknown distribution p over [n], and our goal is to estimate the entropy H(p)=-E[ p(X)] within an -additive error. To that end, at each time point we are allowed to update a finite-state machine with S states, using a possibly randomized but time-invariant rule, where each state of the machine is assigned an entropy estimate. Our goal is to characterize the minimax memory complexity S* of this problem, which is the minimal number of states for which the estimation task is feasible with probability at least 1-δ asymptotically, uniformly in p. Specifically, we show that there exist universal constants C1 and C2 such that S* ≤ C1·n ( n)42δ for not too small, and S* ≥ C2 · \n, n\ for not too large. The upper bound is proved using approximate counting to estimate the logarithm of p, and a finite memory bias estimation machine to estimate the expectation operation. The lower bound is proved via a reduction of entropy estimation to uniformity testing. We also apply these results to derive bounds on the memory complexity of mutual information estimation.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…