Bayes factors and the geometry of discrete hierarchical loglinear models
Abstract
A standard tool for model selection in a Bayesian framework is the Bayes factor which compares the marginal likelihood of the data under two given different models. In this paper, we consider the class of hierarchical loglinear models for discrete data given under the form of a contingency table with multinomial sampling. We assume that the Diaconis-Ylvisaker conjugate prior is the prior distribution on the loglinear parameters and the uniform is the prior distribution on the space of models. Under these conditions, the Bayes factor between two models is a function of their prior and posterior normalizing constants. These constants are functions of the hyperparameters (m,α) which can be interpreted respectively as marginal counts and the total count of a fictive contingency table. We study the behaviour of the Bayes factor when α tends to zero. In this study two mathematical objects play a most important role. They are, first, the interior C of the convex hull C of the support of the multinomial distribution for a given hierarchical loglinear model together with its faces and second, the characteristic function JC of this convex set C. We show that, when α tends to 0, if the data lies on a face Fi of Ci,i=1,2 of dimension ki, the Bayes factor behaves like αk1-k2. This implies in particular that when the data is in C1 and in C2, i.e. when ki equals the dimension of model Ji, the sparser model is favored, thus confirming the idea of Bayesian regularization.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.