Edgeworth correction for the largest eigenvalue in a spiked PCA model

Iain M. Johnstone

Edgeworth correction for the largest eigenvalue in a spiked PCA model

Abstract

We study improved approximations to the distribution of the largest eigenvalue of the sample covariance matrix of n zero-mean Gaussian observations in dimension p+1. We assume that one population principal component has variance > 1 and the remaining `noise' components have common variance 1. In the high dimensional limit p/n γ > 0, we begin study of Edgeworth corrections to the limiting Gaussian distribution of in the supercritical case > 1 + γ. The skewness correction involves a quadratic polynomial as in classical settings, but the coefficients reflect the high dimensional structure. The methods involve Edgeworth expansions for sums of independent non-identically distributed variates obtained by conditioning on the sample noise eigenvalues, and limiting bulk properties and fluctuations of these noise eigenvalues.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…