Limit Theory of the Multi-set Allocation Occupancy (MAO) Distribution: Normal and Poisson Approximations via MAO Norm
Abstract
This paper investigates the asymptotic behavior of the Multi-set Allocation Occupancy (MAO) distribution, which models the count vector X=(X=0,…,X=T) from T independent rounds of sampling without replacement of size m from N individuals. Focusing on X=t (individuals in exactly t subsets) and employing the MAO norm -- a combinatorial tool yielding closed-form factorial moments -- we derive the exact marginal distribution of a single individual as Bin(T,p) with p=m/N. Using the MAO norm, we prove that for any fixed number of distinct individuals, their joint distribution differs from the product of marginals by O(1/N), establishing the weak dependence required for limit theorems. Based on these findings, we delineate two asymptotic regimes: 1. Normal approximation: When N∞ with p=m/N fixed, X=t obeys a central limit theorem and can be approximated by a normal distribution with mean Nπt and variance obtained from the MAO norm, where πt = Tt pt (1-p)T-t.2. Poisson approximation: When the expected value λN := Nπt 0, X=t converges in distribution to Poisson(λ). By symmetry, if λN N, then N-X=t converges to a Poisson distribution. All theoretical results are rigorously proved via moment methods and are corroborated by extensive numerical simulations, which further demonstrate that the approximations remain valid even when the subset sizes are not equal. The MAO norm thus emerges as a unifying tool that connects the exact combinatorial structure of the model to its asymptotic theory. These findings offer clear guidance for choosing the appropriate approximation in practical applications, and they open the door to using the MAO distribution as a null model for detecting non random aggregation in fields as diverse as ecology, sociology, and statistical physics.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.