Adaptive Rate of Convergence of Thompson Sampling for Gaussian Process Optimization

Souvik Ghosh

Adaptive Rate of Convergence of Thompson Sampling for Gaussian Process Optimization

Abstract

We consider the problem of global optimization of a function over a continuous domain. In our setup, we can evaluate the function sequentially at points of our choice and the evaluations are noisy. We frame it as a continuum-armed bandit problem with a Gaussian Process prior on the function. In this regime, most algorithms have been developed to minimize some form of regret. In this paper, we study the convergence of the sequential point xt to the global optimizer x* for the Thompson Sampling approach. Under some assumptions and regularity conditions, we prove concentration bounds for xt where the probability that xt is bounded away from x* decays exponentially fast in t. Moreover, the result allows us to derive adaptive convergence rates depending on the function structure.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…