Faster Stochastic Algorithms for Minimax Optimization under Polyak--{\L}ojasiewicz Conditions

Luo Luo

Faster Stochastic Algorithms for Minimax Optimization under Polyak--ojasiewicz Conditions

Abstract

This paper considers stochastic first-order algorithms for minimax optimization under Polyak--ojasiewicz (PL) conditions. We propose SPIDER-GDA for solving the finite-sum problem of the form x y f(x,y) 1n Σi=1n fi(x,y), where the objective function f(x,y) is μx-PL in x and μy-PL in y; and each fi(x,y) is L-smooth. We prove SPIDER-GDA could find an ε-optimal solution within O((n + n\,xy2) (1/ε)) stochastic first-order oracle (SFO) complexity, which is better than the state-of-the-art method whose SFO upper bound is O((n + n2/3xy2) (1/ε)), where x L/μx and y L/μy. For the ill-conditioned case, we provide an accelerated algorithm to reduce the computational cost further. It achieves O((n+n\,xy) (y/ε) (1/ε)) SFO upper bound when y n. Our ideas can also be applied to a more general setting where the objective function only satisfies the PL condition for one variable. Numerical experiments validate the superiority of proposed methods.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…