Comparing weighted difference and earth mover's distance via Young diagrams
Abstract
We consider two natural statistics on pairs of histograms, in which the n bins have weights 0, …, n-1. The difference (D) between the weighted totals of the histograms is, in a sense, refined by the earth mover's distance (EMD), which measures the amount of work required to equalize the histograms. We were recently surprised, however, by how little EMD actually does refine D in certain real-world applications, which led to the main problem in this paper: what is the probability that EMD = |D|? We derive a formula for this probability, as well as the expected value of |D|, via the combinatorics of Young diagrams and plane partitions. We then generalize our results to an arbitrary number of histograms, where we realize this higher-dimensional D as distance on the Type-A root lattice.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.