On Empirical Spectral Distributions for Random Tensor Product Models
Abstract
In statistics, assuming samples are independent is reasonable. However, this property can fail to hold for the features, a distinction that has led to several lines of work aiming to remove the latter assumption of independence present in the early literature, while preserving the original conclusions. Empirical spectral distributions of covariance matrices are key for understanding the data, and their almost sure convergence is oftentimes desirable. The random tensor product model, X=(xi1xi2...xid)1 ≤ i1<...<id ≤ n for x1,x2,0.05cm...0.05cm,xn i.i.d., introduced by the machine learning community, has a dependence structure for its features far from trivial and has been studied in recent years. When x1 ∈ R, E[x14]<∞, dn1/3=o(1), the empirical spectral distributions of the covariance matrices were proved to converge almost surely to Marchenko-Pastur laws in the random matrix theory regime. This work extends this result to the range dn1/2=~o(1) when x1 is symmetric with a subgaussian norm slowly growing in n (the aforesaid range arises naturally, and the result failing when dn1/2 ∞ appears to be a plausible claim) and shows that similarly to the case with independent features, the almost sure convergence holds under more general conditions on the covariance structure than the isotropic case. The latter result provides a means of deriving convergence for empirical spectral distributions of random matrices, applicable to other models as well so long as their entries exhibit a certain degree of concentration.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.