Fast Subspace Approximation via Greedy Least-Squares

Abstract

In this note, we develop fast and deterministic dimensionality reduction techniques for a family of subspace approximation problems. Let P⊂ RN be a given set of M points. The techniques developed herein find an O(n M)-dimensional subspace that is guaranteed to always contain a near-best fit n-dimensional hyperplane H for P with respect to the cumulative projection error (Σ x ∈ P \| x - ΠH x \|p2)1/p, for any chosen p > 2. The deterministic algorithm runs in O (MN2)-time, and can be randomized to run in only O (MNn)-time while maintaining its error guarantees with high probability. In the case p = ∞ the dimensionality reduction techniques can be combined with efficient algorithms for computing the John ellipsoid of a data set in order to produce an n-dimensional subspace whose maximum 2-distance to any point in the convex hull of P is minimized. The resulting algorithm remains O (MNn)-time. In addition, the dimensionality reduction techniques developed herein can also be combined with other existing subspace approximation algorithms for 2 < p ≤ ∞ - including more accurate algorithms based on convex programming relaxations - in order to reduce their runtimes.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…