An Iterative Algorithm for Differentially Private k-PCA with Adaptive Noise
Abstract
Given n i.i.d. random matrices Ai ∈ Rd × d that share a common expectation , the objective of Differentially Private Stochastic PCA is to identify a subspace of dimension k that captures the largest variance directions of , while preserving differential privacy (DP) of each individual Ai. Existing methods either (i) require the sample size n to scale super-linearly with dimension d, even under Gaussian assumptions on the Ai, or (ii) introduce excessive noise for DP even when the intrinsic randomness within Ai is small. Liu et al. (2022a) addressed these issues for sub-Gaussian data but only for estimating the top eigenvector (k=1) using their algorithm DP-PCA. We propose the first algorithm capable of estimating the top k eigenvectors for arbitrary k ≤ d, whilst overcoming both limitations above. For k=1 our algorithm matches the utility guarantees of DP-PCA, achieving near-optimal statistical error even when n = \!O(d). We further provide a lower bound for general k > 1, matching our upper bound up to a factor of k, and experimentally demonstrate the advantages of our algorithm over comparable baselines.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.