Inference via robust optimal transportation: theory and methods
Abstract
Optimal transportation theory and the related p-Wasserstein distance (Wp, p≥ 1) are widely-applied in statistics and machine learning. In spite of their popularity, inference based on these tools has some issues. For instance, it is sensitive to outliers and it may not be even defined when the underlying model has infinite moments. To cope with these problems, first we consider a robust version of the primal transportation problem and show that it defines the robust Wasserstein distance, W(λ), depending on a tuning parameter λ > 0. Second, we illustrate the link between W1 and W(λ) and study its key measure theoretic aspects. Third, we derive some concentration inequalities for W(λ). Fourth, we use W(λ) to define minimum distance estimators, we provide their statistical guarantees and we illustrate how to apply the derived concentration inequalities for a data driven selection of λ. Fifth, we provide the dual form of the robust optimal transportation problem and we apply it to machine learning problems (generative adversarial networks and domain adaptation). Numerical exercises provide evidence of the benefits yielded by our novel methods.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.