Variance Breakdown of Huber (M)-estimators: n/p → m ∈ (1,∞)

Abstract

A half century ago, Huber evaluated the minimax asymptotic variance in scalar location estimation, F ∈ Fε V(, F) = 1I(Fε*) , where V(,F) denotes the asymptotic variance of the (M)-estimator for location with score function , and I(Fε*) is the minimal Fisher information Fε I(F) over the class of ε-Contaminated Normal distributions. We consider the linear regression model Y = Xθ0 + W, Wii.i.d.F, and iid Normal predictors Xi,j, working in the high-dimensional-limit asymptotic where the number n of observations and p of variables both grow large, while n/p → m ∈ (1,∞); hence m plays the role of `asymptotic number of observations per parameter estimated'. Let Vm(,F) denote the per-coordinate asymptotic variance of the (M)-estimator of regression in the n/p → m regime. Then Vm ≠ V; however Vm → V as m → ∞. In this paper we evaluate the minimax asymptotic variance of the Huber (M)-estimate. The statistician minimizes over the family (λ)λ > 0 of all tunings of Huber (M)-estimates of regression, and Nature maximizes over gross-error contaminations F ∈ Fε. Suppose that I(Fε*) · m > 1. Then λ F ∈ Fε Vm(λ, F) = 1I(Fε*) - 1/m . Strikingly, if I(Fε*) · m ≤ 1, then the minimax asymptotic variance is +∞. The breakdown point is where the Fisher information per parameter equals unity.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…