k-Sample inference via Multimarginal Optimal Transport
Abstract
This paper proposes a Multimarginal Optimal Transport (MOT) approach for simultaneously comparing k≥ 2 measures supported on finite subsets of Rd, d ≥ 1. We derive asymptotic distributions of the optimal value of the empirical MOT program under the null hypothesis that all k measures are same, and the alternative hypothesis that at least two measures are different. We use these results to construct the test of the null hypothesis and provide consistency and power guarantees of this k-sample test. We consistently estimate asymptotic distributions using bootstrap, and propose a low complexity linear program to approximate the test cut-off. We demonstrate the advantages of our approach on synthetic and real datasets, including the real data on cancers in the United States in 2004 - 2020.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.