Nearly Consistent Finite Particle Estimates in Streaming Importance Sampling

Abstract

In Bayesian inference, we seek to compute information about random variables such as moments or quantiles on the basis of available data and prior information. When the distribution of random variables is intractable, Monte Carlo (MC) sampling is usually required. Importance sampling is a standard MC tool that approximates this unavailable distribution with a set of weighted samples. This procedure is asymptotically consistent as the number of MC samples (particles) go to infinity. However, retaining infinitely many particles is intractable. Thus, we propose a way to only keep a finite representative subset of particles and their augmented importance weights that is nearly consistent. To do so in an online manner, we (1) embed the posterior density estimate in a reproducing kernel Hilbert space (RKHS) through its kernel mean embedding; and (2) sequentially project this RKHS element onto a lower-dimensional subspace in RKHS using the maximum mean discrepancy, an integral probability metric. Theoretically, we establish that this scheme results in a bias determined by a compression parameter, which yields a tunable tradeoff between consistency and memory. In experiments, we observe the compressed estimates achieve comparable performance to the dense ones with substantial reductions in representational complexity.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…