Refined α-Divergence Variational Inference via Rejection Sampling
Abstract
We present an approximate inference method, based on a synergistic combination of R\'enyi α-divergence variational inference (RDVI) and rejection sampling (RS). RDVI is based on minimization of R\'enyi α-divergence Dα(p||q) between the true distribution p(x) and a variational approximation q(x); RS draws samples from a distribution p(x) = p(x)/Zp using a proposal q(x), s.t. Mq(x) ≥ p(x), ∀ x. Our inference method is based on a crucial observation that D∞(p||q) equals M(θ) where M(θ) is the optimal value of the RS constant for a given proposal qθ(x). This enables us to develop a two-stage hybrid inference algorithm. Stage-1 performs RDVI to learn qθ by minimizing an estimator of Dα(p||q), and uses the learned qθ to find an (approximately) optimal M(θ). Stage-2 performs RS using the constant M(θ) to improve the approximate distribution qθ and obtain a sample-based approximation. We prove that this two-stage method allows us to learn considerably more accurate approximations of the target distribution as compared to RDVI. We demonstrate our method's efficacy via several experiments on synthetic and real datasets.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.