On the explanatory power of principal components
Abstract
We show that if we have an orthogonal base (u1,…,up) in a p-dimensional vector space, and select p+1 vectors v1,…, vp and w such that the vectors traverse the origin, then the probability of w being to closer to all the vectors in the base than to v1,…, vp is at least 1/2 and converges as p increases to infinity to a normal distribution on the interval [-1,1]; i.e., (1)-(-1)≈0.6826. This result has relevant consequences for Principal Components Analysis in the context of regression and other learning settings, if we take the orthogonal base as the direction of the principal components.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.