Accelerated Methods for Non-Convex Optimization
Abstract
We present an accelerated gradient method for non-convex optimization problems with Lipschitz continuous first and second derivatives. The method requires time O(ε-7/4 (1/ ε) ) to find an ε-stationary point, meaning a point x such that \|∇ f(x)\| ε. The method improves upon the O(ε-2 ) complexity of gradient descent and provides the additional second-order guarantee that ∇2 f(x) -O(ε1/2)I for the computed x. Furthermore, our method is Hessian free, i.e. it only requires gradient computations, and is therefore suitable for large scale applications.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.