Collision-based Testers are Optimal for Uniformity and Closeness

Abstract

We study the fundamental problems of (i) uniformity testing of a discrete distribution, and (ii) closeness testing between two discrete distributions with bounded 2-norm. These problems have been extensively studied in distribution testing and sample-optimal estimators are known for them~Paninski:08, CDVV14, VV14, DKN:15. In this work, we show that the original collision-based testers proposed for these problems ~GRdist:00, BFR+:00 are sample-optimal, up to constant factors. Previous analyses showed sample complexity upper bounds for these testers that are optimal as a function of the domain size n, but suboptimal by polynomial factors in the error parameter ε. Our main contribution is a new tight analysis establishing that these collision-based testers are information-theoretically optimal, up to constant factors, both in the dependence on n and in the dependence on ε.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…