Faster Newton Methods for Convex and Nonconvex Optimization in Gradient Complexity
Abstract
Second-order optimization methods are computationally expensive for large-scale problems. Recently, Doikov, Chayti, and Jaggi (ICML 2023) proposed the LazyCRN method that reduces computation by studying the gradient complexity of second-order methods. Their method can achieve a gradient complexity of O( d + d1/2 ε-3/2) and O( d + d1/2 ε-1/2) for nonconvex and convex optimization, respectively, where d is the effective dimension and ε is the target precision. Very recently, Adil, Bullins, Sidford, and Zhang (NeurIPS 2025) improved the gradient complexity to O( d + d1/3 ε-3/2 18 ε-1) for nonconvex optimization. However, the tightness of these methods remains open. In this work, we propose new methods that achieve an improved complexity of O( d + d1/3 ε-3/2) and O( (d + d13/21 ε-2/7) d) for nonconvex and convex optimization, respectively, improving best-known results for both setups.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.