Mean, Variance and Asymptotic Property for General Hypergeometric Distribution

Abstract

General hypergeometric distribution (GHGD) definition: from a finite space N containing n elements, randomly select totally T subsets Mi (each contains mi elements, 1 ≥ i ≥ T), what is the probability that exactly x elements are overlapped exactly t times or at least t times (xt or x≥ t)? The GHGD described the distribution of random variables xt and x≥ t. In our previous results, we obtained the formulas of mathematical expectation and variance for special situations (T ≤ 7), and not provided proofs. Here, we completed the exact formulas of mean and variance for xt and x≥ t for any situation, and provided strict mathematical proofs. In addition, we give the asymptotic property of the variables. When the mean approaches to 0, the variance fast approaches to the value of mean, and actually, their difference is a higher order infinitesimal of mean. Therefore, when the mean is small enough (<1), it can be used as a fairly accurate approximation of variance.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…