Polyhedral Instability Governs Regret in Online Learning

Abstract

Many online decision problems over combinatorial actions are addressed via convex relaxations, leading to online convex optimization with piecewise linear objectives and induced polyhedral structure. We show that regret in such problems is governed by polyhedral instability: the number of changes of the active region. Under full information feedback and fixed partition assumptions, if RST denotes the number of region switches and V the maximum number of vertices per region, we prove T= Θ((1+RST)\,T\, V) interpolating between experts-like and dimension-dependent OCO rates. For online submodular--concave games under Lovász convexification, this reduces to the permutation-switch count SCT, yielding the matching rate T= Θ((1+SCT)\,T\, n). Experiments on synthetic and real combinatorial problems (shortest path, influence maximization) validate the predicted scaling and indicate that low-instability regimes can arise in practice without explicit enumeration of actions.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…