Fast minimization of structured convex quartics
Abstract
We propose faster methods for unconstrained optimization of structured convex quartics, which are convex functions of the form equation* f(x) = c x + x G x + T[x,x,x] + 124 \| A x \|44 equation* for c ∈ Rd, G ∈ Rd × d, T ∈ Rd × d × d, and A ∈ Rn × d such that A A 0. In particular, we show how to achieve an ε-optimal minimizer for such functions with only O(n1/5O(1)(Z/ε)) calls to a gradient oracle and linear system solver, where Z is a problem-dependent parameter. Our work extends recent ideas on efficient tensor methods and higher-order acceleration techniques to develop a descent method for optimizing the relevant quartic functions. As a natural consequence of our method, we achieve an overall cost of O(n1/5O(1)(Z / ε)) calls to a gradient oracle and (sparse) linear system solver for the problem of 4-regression when A A 0, providing additional insight into what may be achieved for general p-regression. Our results show the benefit of combining efficient higher-order methods with recent acceleration techniques for improving convergence rates in fundamental convex optimization problems.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.