Detection of the number of principal components by extended AIC-type method
Abstract
Estimating the number of principal components is one of the fundamental problems in many scientific fields such as signal processing (or the spiked covariance model). In this paper, we first demonstrate that, for fixed p, any penalty term of the form k'(p-k'/2+1/2)Cn may lead to an asymptotically consistent estimator under the condition that Cn∞ and Cn/n0. We also extend our results to the case n,p∞, with p/n c>0. In this case, for k=o(n13), we first investigate the limiting laws for the leading eigenvalues of the sample covariance matrix Sn under the condition that λk>1+c. At low SNR, since the AIC tends to underestimate the number of signals k, the AIC should be re-defined in this case. As a natural extension of the AIC for fixed p, we propose the extended AIC (EAIC), i.e., the AIC-type method with tuning parameter γ=(c)=1/2+1/c-(1+c)/c, and demonstrate that the EAIC-type method, i.e., the AIC-type method with tuning parameter γ>(c), can select the number of signals k consistently. In the following two cases, (1) p fixed, n∞, (2) n,p∞ with p/n 0, if the AIC is defined as the degeneration of the EAIC in the case n,p∞ with p/n c>0, i.e., γ=c→ 0+0(c)=1, then we have essentially demonstrated that, to achieve the consistency of the AIC-type method in the above two cases, γ>1 is required. Moreover, we show that the EAIC-type method is essentially tuning-free and outperforms the well-known KN estimator proposed in Kritchman and Nadler (2008) and the BCF estimator proposed in Bai, Choi and Fujikoshi (2018). Numerical studies indicate that the proposed method works well.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.