Minimax Estimation of the $L_1$ Distance

Tsachy Weissman

doi:10.1109/TIT.2018.2846245

Minimax Estimation of the L1 Distance

Abstract

We consider the problem of estimating the L1 distance between two discrete probability measures P and Q from empirical data in a nonasymptotic and large alphabet setting. When Q is known and one obtains n samples from P, we show that for every Q, the minimax rate-optimal estimator with n samples achieves performance comparable to that of the maximum likelihood estimator (MLE) with n n samples. When both P and Q are unknown, we construct minimax rate-optimal estimators whose worst case performance is essentially that of the known Q case with Q being uniform, implying that Q being uniform is essentially the most difficult case. The effective sample size enlargement phenomenon, identified in Jiao et al. (2015), holds both in the known Q case for every Q and the Q unknown case. However, the construction of optimal estimators for \|P-Q\|1 requires new techniques and insights beyond the approximation-based method of functional estimation in Jiao et al. (2015).

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…