Optimality of the Subgradient Algorithm in the Stochastic Setting

Douglas Leith

Optimality of the Subgradient Algorithm in the Stochastic Setting

Abstract

We show that the Subgradient algorithm is universal for online learning on the simplex in the sense that it simultaneously achieves O( N) regret for adversarial costs and O(1) pseudo-regret for i.i.d costs. To the best of our knowledge this is the first demonstration of a universal algorithm on the simplex that is not a variant of Hedge. Since Subgradient is a popular and widely used algorithm our results have immediate broad application.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…