Using Variational Inference to Improve the Efficiency of MCMC Algorithms

Abstract

Bayesian statistics makes inference based on Bayes' theorem, but the posterior distribution of unknown parameters is typically analytically intractable. To estimate the posterior, two widely used numerical approximation methods are Markov Chain Monte Carlo (MCMC) and variational inference (VI). MCMC methods produce asymptotically exact samples but are computationally intensive, while VI methods are faster and more scalable but may lack accuracy. This paper proposes combining MCMC and VI to construct algorithms that leverage the strengths of both. The first proposed algorithm uses Gaussian variational inference (GVI) with various covariance structures to derive a linear transformation matrix for Hamiltonian Monte Carlo (HMC). This method improves the efficiency of HMC, particularly in high-dimensional and complex target distributions. The second algorithm combines a VI-based generative model, the variational auto-encoder (VAE), with the Metropolis-Hastings (MH) sampler. The resulting VAE-MH sampler is efficient and effectively traverses the parameter space, outperforming standard MCMC methods in identifying all modes of multi-modal distributions.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…