Improved Approximation Algorithms for Earth-Mover Distance in Data Streams

Abstract

For two multisets S and T of points in []2, such that |S| = |T|= n, the earth-mover distance (EMD) between S and T is the minimum cost of a perfect bipartite matching with edges between points in S and T, i.e., EMD(S,T) = π:S→ TΣa∈ S||a-π(a)||1, where π ranges over all one-to-one mappings. The sketching complexity of approximating earth-mover distance in the two-dimensional grid is mentioned as one of the open problems in the literature. We give two algorithms for computing EMD between two multi-sets when the number of distinct points in one set is a small value k=O(1)( n). Our first algorithm gives a (1+ε)-approximation using O(kε-24n) space and works only in the insertion-only model. The second algorithm gives a O((k3,))-approximation using O(3·· n)-space in the turnstile model.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…