Approximate Nearest Neighbors in Limited Space

Abstract

We consider the (1+ε)-approximate nearest neighbor search problem: given a set X of n points in a d-dimensional space, build a data structure that, given any query point y, finds a point x ∈ X whose distance to y is at most (1+ε) x ∈ X \|x-y\| for an accuracy parameter ε ∈ (0,1). Our main result is a data structure that occupies only O(ε-2 n (n) (1/ε)) bits of space, assuming all point coordinates are integers in the range \-nO(1) … nO(1)\, i.e., the coordinates have O( n) bits of precision. This improves over the best previously known space bound of O(ε-2 n (n)2), obtained via the randomized dimensionality reduction method of Johnson and Lindenstrauss (1984). We also consider the more general problem of estimating all distances from a collection of query points to all data points X, and provide almost tight upper and lower bounds for the space complexity of this problem.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…