Optimal Algorithms and Lower Bounds for Testing Closeness of Structured Distributions
Abstract
We give a general unified method that can be used for L1 closeness testing of a wide range of univariate structured distribution families. More specifically, we design a sample optimal and computationally efficient algorithm for testing the equivalence of two unknown (potentially arbitrary) univariate distributions under the Ak-distance metric: Given sample access to distributions with density functions p, q: I R, we want to distinguish between the cases that p=q and \|p-q\|Ak ε with probability at least 2/3. We show that for any k 2, ε>0, the optimal sample complexity of the Ak-closeness testing problem is (\ k4/5/ε6/5, k1/2/ε2 \). This is the first o(k) sample algorithm for this problem, and yields new, simple L1 closeness testers, in most cases with optimal sample complexity, for broad classes of structured distributions.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.