Reciprocal symmetry breaking in Pareto sampling
Abstract
Let W1,…,WN be a sample of Pareto(α) random variables normalized by their sum, such that Σi Wi=1. The Wi may represent the weights of valleys in a spin glass (if 0<α<1), or the frequency of different lineages (families) in a genealogy. This paper considers a population in which there are N individuals reproducing with Pareto(α) offspring-number distribution (1<α<2). The probability of two randomly-chosen individuals being siblings, Y2=Σi Wi2, gives the sample mean of the normalized size of families, and its reciprocal gives the effective number of families (or reproducing lineages) in the population, Ne=1/Y2. The typical sample mean is very different from the average over all possible samples, i.e. Y2 is not a self-averaging quantity. The typical Y2 and its reciprocal do not vary with N in opposite ways. Non-self-averaging effects are crucial in understanding genetic diversity in mass spawning species such as marine fishes.