Limit theorems for a class of random outer measures in infinite urn schemes

Abstract

An urn scheme is a probabilistic model in which balls are placed into urns sequentially and independently of each other. All balls share the same probability distribution for hitting the urns. In the simplest case, there is a finite number of urns and the probabilities of hitting each urn are equal. In an infinite urn scheme, there is a countable number of urns, and the hitting probabilities form a probability mass function on the set of urn labels, so they depend on the urn number. The statistics of interest is the number of urns with at least k balls after throwing n balls. Thus, we assume that there is a countable family of urns, and we fix the probabilities for a ball to hit each urn (the same for all balls). For an arbitrary subset A of the unit interval [0, 1], we do not consider all ball indices from 1 to n, but only those that belong to the set nA, and we study the number of urns with at least k balls after throwing the balls with indices in nA. This number is non-negative, and if the set A is empty, this number is equal to zero. Moreover, if k=1, then it satisfies the property of countable subadditivity. Hence, the number of non-empty urns for ball indices in nA, where A is an arbitrary subset of the unit interval, satisfies all the axioms of an outer measure on the unit interval. We study the properties of the statistics of interest. Our main result is a functional central limit theorem for sets A consisting of finite unions of intervals and parameterized by their boundary points. We consider applications of this theorem to elementary probabilistic models of text.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…