A second order regret bound for NormalHedge

Abstract

We consider the problem of prediction with expert advice for ``easy'' sequences. We show that a variant of NormalHedge enjoys a second-order ε-quantile regret bound of O(VT (VT/ε)) when VT > N, where VT is the cumulative second moment of instantaneous per-expert regret averaged with respect to a natural distribution determined by the algorithm. The algorithm is motivated by a continuous time limit using Stochastic Differential Equations. The discrete time analysis uses self-concordance techniques.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…