Bayesian Posteriors For Arbitrarily Rare Events
Abstract
We study how much data a Bayesian observer needs to correctly infer the relative likelihoods of two events when both events are arbitrarily rare. Each period, either a blue die or a red die is tossed. The two dice land on side 1 with unknown probabilities p1 and q1, which can be arbitrarily low. Given a data-generating process where p1 c q1, we are interested in how much data is required to guarantee that with high probability the observer's Bayesian posterior mean for p1 exceeds (1-δ)c times that for q1. If the prior densities for the two dice are positive on the interior of the parameter space and behave like power functions at the boundary, then for every ε>0, there exists a finite N so that the observer obtains such an inference after n periods with probability at least 1-ε whenever np1 N. The condition on n and p1 is the best possible. The result can fail if one of the prior densities converges to zero exponentially fast at the boundary.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.