Models for Genetic Diversity Generated by Negative Binomial Point Processes
Abstract
We develop a model based on a generalised Poisson-Dirichlet distribution for the analysis of genetic diversity, and illustrate its use on microsatellite data for the genus Dasyurus (the quoll, a marsupial carnivore listed as near-threatened in Australia). Our class of distributions, termed PDα(r), is constructed from a negative binomial point process, generalizing the usual one-parameter PDα model, which is constructed from a Poisson point process. Both models have at their heart a Stable(α) process, but in PDα(r), an extra parameter r>0 adds flexibility, analogous to the way the negative binomial distribution allows for "overdispersion" in the analysis of count data. A key result obtained is a generalised version of Ewens' sampling formula for PDα(r). We outline the theoretical basis for the model, and, for the quolls data, estimate the parameters α and r by least squares, showing how the extra parameter r aids in the interpretability of the data by comparison with the standard PDα model. The methods potentially have implications for the management and conservation of threatened populations.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.