Robust Estimation of Covariance Matrices: Adversarial Contamination and Beyond

Abstract

We consider the problem of estimating the covariance structure of a random vector Y∈ Rd from a sample Y1,…,Yn. We are interested in the situation when d is large compared to n but the covariance matrix of interest has (exactly or approximately) low rank. We assume that the given sample is (a) ε-adversarially corrupted, meaning that ε fraction of the observations could have been replaced by arbitrary vectors, or that (b) the sample is i.i.d. but the underlying distribution is heavy-tailed, meaning that the norm of Y possesses only finite fourth moments. We propose an estimator that is adaptive to the potential low-rank structure of the covariance matrix as well as to the proportion of contaminated data, and admits tight deviation guarantees despite rather weak assumptions on the underlying distribution. Finally, we discuss the algorithms that allow to approximate the proposed estimator in a numerically efficient way.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…