Bayesian inference for spectral projectors of the covariance matrix

Abstract

Let X1, …, Xn be i.i.d. sample in Rp with zero mean and the covariance matrix *. The classical PCA approach recovers the projector P*J onto the principal eigenspace of * by its empirical counterpart PJ. Recent paper [Koltchinskii, Lounici (2017)] investigated the asymptotic distribution of the Frobenius distance between the projectors \| PJ - P*J \|2, while [Naumov et al. (2017)] offered a bootstrap procedure to measure uncertainty in recovering this subspace P*J even in a finite sample setup. The present paper considers this problem from a Bayesian perspective and suggests to use the credible sets of the pseudo-posterior distribution on the space of covariance matrices induced by the conjugated Inverse Wishart prior as sharp confidence sets. This yields a numerically efficient procedure. Moreover, we theoretically justify this method and derive finite sample bounds on the corresponding coverage probability. Contrary to [Koltchinskii, Lounici (2017), Naumov et al. (2017)], the obtained results are valid for non-Gaussian data: the main assumption that we impose is the concentration of the sample covariance in a vicinity of *. Numerical simulations illustrate good performance of the proposed procedure even on non-Gaussian data in a rather challenging regime.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…