Optimal Sketching for Kronecker Product Regression and Low Rank Approximation
Abstract
We study the Kronecker product regression problem, in which the design matrix is a Kronecker product of two or more matrices. Given Ai ∈ Rni × di for i=1,2,…,q where ni di for each i, and b ∈ Rn1 n2 ·s nq, let A = A1 A2 ·s Aq. Then for p ∈ [1,2], the goal is to find x ∈ Rd1 ·s dq that approximately minimizes \|Ax - b\|p. Recently, Diao, Song, Sun, and Woodruff (AISTATS, 2018) gave an algorithm which is faster than forming the Kronecker product A Specifically, for p=2 their running time is O(Σi=1q nnz(Ai) + nnz(b)), where nnz(Ai) is the number of non-zero entries in Ai. Note that nnz(b) can be as large as n1 ·s nq. For p=1, q=2 and n1 = n2, they achieve a worse bound of O(n13/2 poly(d1d2) + nnz(b)). In this work, we provide significantly faster algorithms. For p=2, our running time is O(Σi=1q nnz(Ai) ), which has no dependence on nnz(b). For p<2, our running time is O(Σi=1q nnz(Ai) + nnz(b)), which matches the prior best running time for p=2. We also consider the related all-pairs regression problem, where given A ∈ Rn × d, b ∈ Rn, we want to solve x \|Ax - b\|p, where A ∈ Rn2 × d, b ∈ Rn2 consist of all pairwise differences of the rows of A,b. We give an O(nnz(A)) time algorithm for p ∈[1,2], improving the (n2) time needed to form A. Finally, we initiate the study of Kronecker product low rank and low t-rank approximation. For input A as above, we give O(Σi=1q nnz(Ai)) time algorithms, which is much faster than computing A.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.