Parameter estimation for integer-valued Gibbs distributions

Abstract

A central problem in computational statistics is to convert a procedure for sampling combinatorial from an objects into a procedure for counting those objects, and vice versa. Weconsider sampling problems coming from *Gibbs distributions*, which are probability distributions of the form μβ(ω) eβ H(ω) for β in an interval [β, β] and H( ω ) ∈ \0 \ [1, n]. The *partition function* is the normalization factor Z(β)=Σω ∈eβ H(ω). Two important parameters are the log partition ratio q = Z(β)Z(β) and the vector of counts cx = |H-1(x)|. Our first result is an algorithm to estimate the counts cx using roughly O( qε2) samples for general Gibbs distributions and O( n2ε2 ) samples for integer-valued distributions (ignoring some second-order terms and parameters). We show this is optimal up to logarithmic factors. We illustrate with improved algorithms for counting connected subgraphs and perfect matchings in a graph. We develop a key subroutine for global estimation of the partition function. Specifically, we produce a data structure to estimate Z(β) for all values β, without further samples. Constructing the data structure requires O(q nε2) samples for general Gibbs distributions and O(n2 nε2 + n q) samples for integer-valued distributions. This improves over a prior algorithm of Kolmogorov (2018) which computes the single point estimate Z(β) using O(qε2) samples. We also show that this complexity is optimal as a function of n and q up to logarithmic terms.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…