Universal Bayesian Measures and Universal Histogram Sequences
Abstract
Consider universal data compression: the length l(xn) of sequence xn∈ An with finite alphabet A and length n satisfies Kraft's inequality over An, and -1n Pn(xn)Qn(xn) almost surely converges to zero as n grows for the Qn(xn)=2-l(xn) and any stationary ergodic source P. In this paper, we say such a Q is a universal Bayesian measure. We generalize the notion to the sources in which the random variables may be either discrete, continuous, or none of them. The basic idea is due to Boris Ryabko who utilized model weighting over histograms that approximate P, assuming that a density function of P exists. However, the range of P depends on the choice of the histogram sequence. The universal Bayesian measure constructed in this paper overcomes the drawbacks and has many applications to infer relation among random variables, and extends the application area of the minimum description length principle.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.