From Average Sensitivity to Small-Loss Regret Bounds under Random-Order Model
Abstract
We study online learning in the random-order model, where the multiset of loss functions is chosen adversarially but revealed in a uniformly random order. By extending the batch-to-online transformation of Dong and Yoshida (2023), we show that if an offline algorithm enjoys a (1+)-approximation guarantee, an average sensitivity bound controlled by a function (), and stability with respect to , then we can obtain a small-loss regret bound typically of order O((OPTT)), where is the concave conjugate of , OPTT is the offline optimum over T rounds, and O hides polylogarithmic factors in T. Our result refines their original (1+)-approximate regret guarantee and applies to a broad class of problems, including online k-means clustering and online low-rank approximation. We further apply our approach to online submodular function minimization using (1)-cut sparsifiers of submodular hypergraphs, obtaining a small-loss regret bound of O(n3 + n3/4OPTT3/4), where n is the ground-set size; we also demonstrate its applicability to online 1 regression. Our work sheds light on the power of sparsification and related algorithmic techniques in achieving small-loss regret bounds in the random-order model, without requiring structural assumptions on loss functions, such as linearity or smoothness.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.