Statistical Mechanics of Inference

Abstract

Statistical modeling often involves identifying an optimal estimate to some underlying probability distribution known to satisfy some given constraints. I show here that choosing as estimate the centroid, or center of mass, of the set consistent with the constraints formally minimizes an objective measure of the expected error. Further, I obtain a useful approximation to this point, valid in the thermodynamic limit, that immediately provides much information relating to the full solution set's geometry. For weak constraints, the centroid is close to the popular maximum entropy solution, whereas for strong constraints the two are far apart. Because of this, centroid inference is often substantially more accurate. The results I present allow for its straightforward application.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…