Frequent Directions : Simple and Deterministic Matrix Sketching

Abstract

We describe a new algorithm called Frequent Directions for deterministic matrix sketching in the row-updates model. The algorithm is presented an arbitrary input matrix A ∈ Rn × d one row at a time. It performed O(d × ) operations per row and maintains a sketch matrix B ∈ R × d such that for any k < \|ATA - BTB \|2 ≤ \|A - Ak\|F2 / (-k) and \|A - πBk(A)\|F2 ≤ (1 + k-k) \|A-Ak\|F2 . Here, Ak stands for the minimizer of \|A - Ak\|F over all rank k matrices (similarly Bk) and πBk(A) is the rank k matrix resulting from projecting A on the row span of Bk. We show both of these bounds are the best possible for the space allowed. The summary is mergeable, and hence trivially parallelizable. Moreover, Frequent Directions outperforms exemplar implementations of existing streaming algorithms in the space-error tradeoff.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…