Finite sample properties of the mean occupancy counts and probabilities

Abstract

For a probability distribution P on an at most countable alphabet A, this article gives finite sample bounds for the expected occupancy counts E Kn,r and probabilities E Mn,r. Both upper and lower bounds are given in terms of the counting function of P. Special attention is given to the case where is bounded by a regularly varying function. In this case, it is shown that our general results lead to an optimal-rate control of the expected occupancy counts and probabilities with explicit constants. Our results are also put in perspective with Turing's formula and recent concentration bounds to deduce bounds in probability. At the end of the paper, we discuss an extension of the occupancy problem to arbitrary distributions in a metric space.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…