A Berry-Esseen theorem for Pitman's α-diversity
Abstract
This paper is concerned with the study of the random variable Kn denoting the number of distinct elements in a random sample (X1, …, Xn) of exchangeable random variables driven by the two parameter Poisson-Dirichlet distribution, PD(α,θ). For α∈(0,1), Theorem 3.8 in Pit(06) shows that Knnαa.s. Sα,θ as n→+∞. Here, Sα,θ is a random variable distributed according to the so-called scaled Mittag-Leffler distribution. Our main result states that x ≥ 0 | [Knnα ≤ x ] - [Sα,θ ≤ x] | ≤ C(α, θ)nα holds with an explicit constant C(α, θ). The key ingredients of the proof are a novel probabilistic representation of Kn as compound distribution and new, refined versions of certain quantitative bounds for the Poisson approximation and the compound Poisson distribution.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.