Optimal Algorithms and Lower Bounds for Testing Closeness of Structured Distributions

Abstract

We give a general unified method that can be used for L1 closeness testing of a wide range of univariate structured distribution families. More specifically, we design a sample optimal and computationally efficient algorithm for testing the equivalence of two unknown (potentially arbitrary) univariate distributions under the Ak-distance metric: Given sample access to distributions with density functions p, q: I R, we want to distinguish between the cases that p=q and \|p-q\|Ak ε with probability at least 2/3. We show that for any k 2, ε>0, the optimal sample complexity of the Ak-closeness testing problem is (\ k4/5/ε6/5, k1/2/ε2 \). This is the first o(k) sample algorithm for this problem, and yields new, simple L1 closeness testers, in most cases with optimal sample complexity, for broad classes of structured distributions.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…