Pairwise Multi-marginal Optimal Transport and Embedding for Earth Mover's Distance
Abstract
We investigate the problem of pairwise multi-marginal optimal transport, that is, given a collection of probability distributions \Pα\ on a Polish space X, to find a coupling \Xα\, Xα Pα, such that E[c(Xα,Xβ)] r∈fX Pα,Y PβE[c(X,Y)] for all α,β, where c is a cost function and r1. In other words, every pair (Xα,Xβ) has an expected cost at most a factor of r from its lowest possible value. This can be regarded as a locality sensitive hash function for probability distributions, and has applications such as robust and distributed computation of transport plans. It can also be considered as a bi-Lipschitz embedding of the collection of probability distributions into the space of random variables taking values on X. For c(x,y)= x-y2q on Rn, where q>0, we show that a finite r is attainable if and only if either n=1 or 0<q<1. As n∞, the growth rate of the smallest possible r is exactly (nq/2) if 0<q<1. Hence, the metric space of probability distributions on Rn with finite q-th absolute moments, 0<q<1, with the earth mover's distance (or 1-Wasserstein distance) with respect to the snowflake metric c(x,y)= x-y2q, is bi-Lipschitz embeddable into L1 with distortion O(nq/2). If we consider c(x,y)= x-y2 (i.e., q=1) on the grid [0..s]n instead of Rn, then r=O(n s) is attainable, which implies the embeddability of the space of probability distributions on [0..s]n into L1 with distortion O(n s), and improves upon the O(n s) result by Indyk and Thaper. The case of the discrete metric cost c(x,y)=1\x≠ y\ and more general metric and ultrametric costs are also investigated.