Bandit optimisation of functions in the Mat\'ern kernel RKHS

Abstract

We consider the problem of optimising functions in the reproducing kernel Hilbert space (RKHS) of a Mat\'ern kernel with smoothness parameter over the domain [0,1]d under noisy bandit feedback. Our contribution, the π-GP-UCB algorithm, is the first practical approach with guaranteed sublinear regret for all >1 and d ≥ 1. Empirical validation suggests better performance and drastically improved computational scalablity compared with its predecessor, Improved GP-UCB.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…