A Random Matrix--Theoretic Approach to Handling Singular Covariance Estimates
Abstract
In many practical situations we would like to estimate the covariance matrix of a set of variables from an insufficient amount of data. More specifically, if we have a set of N independent, identically distributed measurements of an M dimensional random vector the maximum likelihood estimate is the sample covariance matrix. Here we consider the case where N<M such that this estimate is singular and therefore fundamentally bad. We present a radically new approach to deal with this situation. Let X be the M× N data matrix, where the columns are the N independent realizations of the random vector with covariance matrix . Without loss of generality, we can assume that the random variables have zero mean. We would like to estimate from X. Let K be the classical sample covariance matrix. Fix a parameter 1≤ L≤ N and consider an ensemble of L× M random unitary matrices, \\, having Haar probability measure. Pre and post multiply K by , and by the conjugate transpose of respectively, to produce a non--singular L× L reduced dimension covariance estimate. A new estimate for , denoted by covL(K), is obtained by a) projecting the reduced covariance estimate out (to M× M) through pre and post multiplication by the conjugate transpose of , and by respectively, and b) taking the expectation over the unitary ensemble. Another new estimate (this time for -1), invcovL(K), is obtained by a) inverting the reduced covariance estimate, b) projecting the inverse out (to M× M) through pre and post multiplication by the conjugate transpose of , and by respectively, and c) taking the expectation over the unitary ensemble. We have a closed analytical expression for invcovL(K) and covL(K) in terms of its eigenvalue decomposition.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.