Thinking Inside the Ball: Near-Optimal Minimization of the Maximal Loss
Abstract
We characterize the complexity of minimizing i∈[N] fi(x) for convex, Lipschitz functions f1,…, fN. For non-smooth functions, existing methods require O(Nε-2) queries to a first-order oracle to compute an ε-suboptimal point and O(Nε-1) queries if the fi are O(1/ε)-smooth. We develop methods with improved complexity bounds of O(Nε-2/3 + ε-8/3) in the non-smooth case and O(Nε-2/3 + Nε-1) in the O(1/ε)-smooth case. Our methods consist of a recently proposed ball optimization oracle acceleration algorithm (which we refine) and a careful implementation of said oracle for the softmax function. We also prove an oracle complexity lower bound scaling as (Nε-2/3), showing that our dependence on N is optimal up to polylogarithmic factors.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.