Sharp First-Order Lower Bounds under Sublevel α-Polyak-Lojasiewicz Conditions

Abstract

We study the optimal complexity of first-order methods under the α-Polyak-Lojasiewicz condition with α∈[1,2). This condition bounds the suboptimality gap by a power α of the gradient norm; α=2 recovers the classical Polyak-Lojasiewicz condition, α=1 corresponds to a Holder error-bound regime, and intermediate exponents arise near degenerate minima in local Kurdyka-Lojasiewicz geometry. We first prove a structural obstruction: if global smoothness and a global α-Polyak-Lojasiewicz inequality are imposed on Rd, then every such function is constant for α<2. This motivates the globally smooth, sublevel-α-Polyak-Lojasiewicz class, where the inequality is required only on the initial sublevel set. On this class, we prove sharp minimax lower bounds for first-order methods. In the deterministic oracle model, any first-order method requires Ω(Lτ2/αε-(2-α)/α) queries to reach accuracy ε, matching gradient descent. In the bounded-variance stochastic-gradient oracle model, any stochastic first-order method requires Ω(Lσ2τ4/αε-(4-α)/α) queries in the noise-dominated regime, matching known SGD upper rates under trajectory-containment assumptions.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…