On the Randomized Complexity of Minimizing a Convex Quadratic Function

Abstract

Minimizing a convex, quadratic objective of the form fA,b(x) := 12x A x - b, x for A 0 is a fundamental problem in machine learning and optimization. In this work, we prove gradient-query complexity lower bounds for minimizing convex quadratic functions which apply to both deterministic and randomized algorithms. Specifically, for > 1, we exhibit a distribution over (A,b) with condition number cond(A) , such that any randomized algorithm requires () gradient queries to find a solution x for which \| x - x\| ε0\|x\|, where x = A-1b is the optimal solution, and ε0 a small constant. Setting =1/ε, this lower bound implies the minimax rate of T = (λ1(A)\| x\|2/ε) queries required to minimize an arbitrary convex quadratic function up to error f(x) - f( x) ε. Our lower bound holds for a distribution derived from classical ensembles in random matrix theory, and relies on a careful reduction from adaptively estimating a planted vector u in a deformed Wigner model. A key step in deriving sharp lower bounds is demonstrating that the optimization error x - x cannot align too closely with u. To this end, we prove an upper bound on the cosine between x - x and u in terms of the MMSE of estimating the plant u in a deformed Wigner model. We then bound the MMSE by carefully modifying a result due to Lelarge and Miolane 2016, which rigorously establishes a general replica-symmetric formula for planted matrix models.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…