Affine Invariant Covariance Estimation for Heavy-Tailed Distributions

Abstract

In this work we provide an estimator for the covariance matrix of a heavy-tailed multivariate distributionWe prove that the proposed estimator S admits an affine-invariant bound of the form \[(1-) S S (1+) S\]in high probability, where S is the unknown covariance matrix, and is the positive semidefinite order on symmetric matrices. The result only requires the existence of fourth-order moments, and allows for = O(4 d(d/δ)/n) where 4 is a measure of kurtosis of the distribution, d is the dimensionality of the space, n is the sample size, and 1-δ is the desired confidence level. More generally, we can allow for regularization with level λ, then d gets replaced with the degrees of freedom number. Denoting cond(S) the condition number of S, the computational cost of the novel estimator is O(d2 n + d3(cond(S))), which is comparable to the cost of the sample covariance estimator in the statistically interesing regime n d. We consider applications of our estimator to eigenvalue estimation with relative error, and to ridge regression with heavy-tailed random design.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…