When can a posterior predictive check identify the learning rate? Exact degeneracy in Gaussian models and implications for Generalised Bayesian Inerence

Abstract

Generalised Bayesian inference tempers the likelihood by a learning rate η to mitigate model misspecification, and the choice of η is consequential. Zafar and Nicholls (2024) proposed selecting η by a posterior predictive check (PPC): one chooses the smallest η at which a log-likelihood PPC p-value is not rejected. An exact, finite-sample analysis of this selector on the Gaussian linear model is given. With known variance and a flat prior, the PPC p-value equals P(χ2n > RSS/σ02) for every η, so the selector is η-invariant; under variance misspecification it is two-sided non-identifying. With unknown variance and the reference prior, the p-value depends only on (n,d,η) and not on the realised data or the data-generating process. Consequently the selector's output is fixed before any data are seen, typically collapsing to the smallest grid value, which over-tempers and inflates predictive intervals relative to held-out selection. The phenomenon is a pivotality property specific to the Gaussian scale--location family and the reference prior; it disappears under informative priors. These results delineate the selector's scope, identify a canonical class on which it cannot identify the learning rate, and motivate a cheap, data-free pre-screening diagnostic.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…