Spectral analysis of the Gram matrix of mixture models

Abstract

This text is devoted to the asymptotic study of some spectral properties of the Gram matrix W T W built upon a collection w1, …, wn∈ Rp of random vectors (the columns of W), as both the number n of observations and the dimension p of the observations tend to infinity and are of similar order of magnitude. The random vectors w1, …, wn are independent observations, each of them belonging to one of k classes C1,…, Ck. The observations of each class Ca (1 a k) are characterized by their distribution N(0, p-1Ca), where C1, …, Ck are some non negative definite p× p matrices. The cardinality na of class Ca and the dimension p of the observations are such that nan (1 a k) and pn stay bounded away from 0 and +∞. We provide deterministic equivalents to the empirical spectral distribution of W TW and to the matrix entries of its resolvent (as well as of the resolvent of WW T). These deterministic equivalents are defined thanks to the solutions of a fixed-point system. Besides, we prove that W T W has asymptotically no eigenvalues outside the bulk of its spectrum, defined thanks to these deterministic equivalents. These results are directly used in our companion paper "Kernel spectral clustering of large dimensional data", which is devoted to the analysis of the spectral clustering algorithm in large dimensions. They also find applications in various other fields such as wireless communications where functionals of the aforementioned resolvents allow one to assess the communication performance across multi-user multi-antenna channels.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…