Asymptotic Joint Distribution of Extreme Sample Eigenvalues and Eigenvectors in the Spiked Population Model
Abstract
In this paper, we consider a data matrix XN∈RN× p where all the rows are i.i.d. samples in Rp of mean zero and covariance matrix ∈Rp× p. Here the population matrix is of finite rank perturbation of the identity matrix. This is the "spiked population model" first proposed by Johnstone. As N, p∞ but N/p γ∈(1, ∞), for the sample covariance matrix SN := XNXNT/N, we establish the joint distribution of the largest and the smallest few packs of eigenvalues. Inside each pack, they will behave the same as the eigenvalues drawn from a Gaussian matrix of the corresponding size. Among different packs, we also calculate the covariance between the Gaussian matrices entries. As a corollary, if all the rows of the data matrix are Gaussian, then these packs will be asymptotically independent. Also, the asymptotic behavior of sample eigenvectors are obtained. Their local fluctuation is also Gaussian with covariance explicitly calculated.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.