O(k)-Equivariant Dimensionality Reduction on Stiefel Manifolds
Abstract
Many real-world datasets live on high-dimensional Stiefel and Grassmannian manifolds, Vk(RN) and Gr(k, RN) respectively, and benefit from projection onto lower-dimensional Stiefel and Grassmannian manifolds. In this work, we propose an algorithm called Principal Stiefel Coordinates (PSC) to reduce data dimensionality from Vk(RN) to Vk(Rn) in an O(k)-equivariant manner (k ≤ n N). We begin by observing that each element α ∈ Vn(RN) defines an isometric embedding of Vk(Rn) into Vk(RN). Next, we describe two ways of finding a suitable embedding map α: one via an extension of principal component analysis (αPCA), and one that further minimizes data fit error using gradient descent (αGD). Then, we define a continuous and O(k)-equivariant map πα that acts as a "closest point operator" to project the data onto the image of Vk(Rn) in Vk(RN) under the embedding determined by α, while minimizing distortion. Because this dimensionality reduction is O(k)-equivariant, these results extend to Grassmannian manifolds as well. Lastly, we show that παPCA globally minimizes projection error in a noiseless setting, while παGD achieves a meaningfully different and improved outcome when the data does not lie exactly on the image of a linearly embedded lower-dimensional Stiefel manifold as above. Multiple numerical experiments using synthetic and real-world data are performed.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.