SURE Information Criteria for Large Covariance Matrix Estimation and Their Asymptotic Properties
Abstract
Consider n independent and identically distributed p-dimensional Gaussian random vectors with covariance matrix . The problem of estimating when p is much larger than n has received a lot of attention in recent years. Yet little is known about the information criterion for covariance matrix estimation. How to properly define such a criterion and what are the statistical properties? We attempt to answer these questions in the present paper by focusing on the estimation of bandable covariance matrices when p>n but (p)=o(n). Motivated by the deep connection between Stein's unbiased risk estimation (SURE) and AIC in regression models, we propose a family of generalized SURE (SUREc) indexed by c for covariance matrix estimation, where c is some constant. When c is 2, SURE2 provides an unbiased estimator of the Frobenious risk of the covariance matrix estimator. Furthermore, we show that by minimizing SURE2 over all possible banding covariance matrix estimators we attain the minimax optimal rate of convergence and the resulting estimator behaves like the covariance matrix estimator obtained by the so-called oracle tuning. On the other hand, we also show that SURE2 is selection inconsistent when the true covariance matrix is exactly banded. To fix the selection inconsistency, we consider using SURE with c=(n) and prove that by minimizing SURE(n) we select the true bandwith with probability tending to one. Therefore, our analysis indicates that SURE2 and SURE(n) can be regarded as the AIC and BIC for large covariance matrix estimation, respectively.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.