Robust subgaussian estimation of a mean vector in nearly linear time

Abstract

We construct an algorithm, running in time O(N d + uK d), which is robust to outliers and heavy-tailed data and which achieves the subgaussian rate from [Lugosi, Mendelson] equationeq:introsubgausrate Tr()N+||||opKN equationwith probability at least 1-(-c0K)-(-c1 u) where is the covariance matrix of the informative data, K∈\1, …, K\ is some parameter (number of block means) and u>0 is another parameter of the algorithm. This rate is achieved when K≥ c1 | O| where | O| is the number of outliers in the database and under the only assumption that the informative data have a second moment. The algorithm is fully data-dependent and does not use in its construction the proportion of outliers nor the rate above. Its construction combines recently developed tools for Median-of-Means estimators and covering-Semi-definite Programming [Chen, Diakonikolas, Ge] and [Peng, Tangwongsan, Zhang].

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…