Universal heavy-ball method for nonconvex optimization under H\"older continuous Hessians
Abstract
We propose a new first-order method for minimizing nonconvex functions with Lipschitz continuous gradients and H\"older continuous Hessians. The proposed algorithm is a heavy-ball method equipped with two particular restart mechanisms. It finds a solution where the gradient norm is less than ε in O(H12 + 2 ε- 4 + 3 2 + 2 ) function and gradient evaluations, where ∈ [0, 1] and H are the H\"older exponent and constant, respectively. Our algorithm is -independent and thus universal; it automatically achieves the above complexity bound with the optimal ∈ [0, 1] without knowledge of H. In addition, the algorithm does not require other problem-dependent parameters as input, including the gradient's Lipschitz constant or the target accuracy ε. Numerical results illustrate that the proposed method is promising.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.