Sample canonical correlation coefficients of high-dimensional random vectors with finite rank correlations

Abstract

Consider two random vectors x ∈ Rp and y ∈ Rq of the forms x=A z+ C11/2 x and y=B z+ C21/2 y, where x∈ Rp, y∈ Rq and z∈ Rr are independent vectors with i.i.d. entries of mean 0 and variance 1, C1 and C2 are p × p and q× q deterministic covariance matrices, and A and B are p× r and q× r deterministic matrices. With n independent observations of ( x, y), we study the sample canonical correlations between x and y. We consider the high-dimensional setting with finite rank correlations. Let t1 t2 ·s tr be the squares of the nontrivial population canonical correlation coefficients, and let λ1 λ2·sλp q be the squares of the sample canonical correlation coefficients. If the entries of x, y and z are i.i.d. Gaussian, then the following dichotomy has been shown in [7] for a fixed threshold tc ∈(0, 1): for 1 i r, if ti < tc, then λi converges to the right-edge λ+ of the limiting eigenvalue spectrum of the sample canonical correlation matrix; if ti>tc, then λi converges to a deterministic limit θi ∈ (λ+,1) determined by ti. In this paper, we prove that these results hold universally under the sharp fourth moment conditions on the entries of x and y. Moreover, we prove the results in full generality, in the sense that they also hold for near-degenerate ti's and for ti's that are close to the threshold tc.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…