Gradient-Variation Regret Bounds for Unconstrained Online Learning

Abstract

We develop parameter-free algorithms for unconstrained online learning with regret guarantees that scale with the gradient variation VT(u) = Σt=2T \|∇ ft(u)-∇ ft-1(u)\|2. For L-smooth convex loss, we provide fully-adaptive algorithms achieving regret of order O(\|u\|VT(u) + L\|u\|2+G4) without requiring prior knowledge of comparator norm \|u\|, Lipschitz constant G, or smoothness L. The update in each round can be computed efficiently via a closed-form expression. Our results extend to dynamic regret and find immediate implications to the stochastically-extended adversarial (SEA) model, which significantly improves upon the previous best-known result [Wang et al., 2025].

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…