On Using Toeplitz and Circulant Matrices for Johnson-Lindenstrauss Transforms

Abstract

The Johnson-Lindenstrauss lemma is one of the corner stone results in dimensionality reduction. It says that given N, for any set of N vectors X ⊂ Rn, there exists a mapping f : X Rm such that f(X) preserves all pairwise distances between vectors in X to within (1 ) if m = O(-2 N). Much effort has gone into developing fast embedding algorithms, with the Fast Johnson-Lindenstrauss transform of Ailon and Chazelle being one of the most well-known techniques. The current fastest algorithm that yields the optimal m = O(-2 N) dimensions has an embedding time of O(n n + -2 3 N). An exciting approach towards improving this, due to Hinrichs and Vyb\'iral, is to use a random m × n Toeplitz matrix for the embedding. Using Fast Fourier Transform, the embedding of a vector can then be computed in O(n m) time. The big question is of course whether m = O(-2 N) dimensions suffice for this technique. If so, this would end a decades long quest to obtain faster and faster Johnson-Lindenstrauss transforms. The current best analysis of the embedding of Hinrichs and Vyb\'iral shows that m = O(-22 N) dimensions suffices. The main result of this paper, is a proof that this analysis unfortunately cannot be tightened any further, i.e., there exists a set of N vectors requiring m = (-2 2 N) for the Toeplitz approach to work.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…