Efficient First Order Method for Saddle Point Problems with Higher Order Smoothness
Abstract
This paper studies the complexity of finding approximate stationary points for the smooth nonconvex-strongly-concave (NC-SC) saddle point problem: xyf(x,y). Under the standard first-order smoothness conditions where f is -smooth in both arguments and μy-strongly concave in y, existing literature shows that the optimal complexity for first-order methods to obtain an ε-stationary point is O(yε-2), where y=/μy is the condition number. However, when (x):=y f(x,y) has L2-Lipschitz continuous Hessian in addition, we derive a first-order algorithm with an O(y1/2L21/4ε-7/4) complexity by designing an accelerated proximal point algorithm enhanced with the "Convex Until Proven Guilty" technique. Moreover, an improved (y3/7L22/7ε-12/7) lower bound for first-order method is also derived for sufficiently small ε. As a result, given the second-order smoothness of the problem, the complexity of our method improves the state-of-the-art result by a factor of O((2L2ε)1/4), while almost matching the lower bound except for a small O((2L2ε)1/28) factor.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.