Variational Orthogonal Features

Abstract

Sparse stochastic variational inference allows Gaussian process models to be applied to large datasets. The per iteration computational cost of inference with this method is O(NM2+M3), where N is the number of points in a minibatch and M is the number of `inducing features', which determine the expressiveness of the variational family. Several recent works have shown that for certain priors, features can be defined that remove the O(M3) cost of computing a minibatch estimate of an evidence lower bound (ELBO). This represents a significant computational savings when M N. We present a construction of features for any stationary prior kernel that allow for computation of an unbiased estimator to the ELBO using T Monte Carlo samples in O(NT+M2T) and in O(NT+MT) with an additional approximation. We analyze the impact of this additional approximation on inference quality.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…