Minimax Lower Bounds for Linear Independence Testing
Abstract
Linear independence testing is a fundamental information-theoretic and statistical problem that can be posed as follows: given n points \(Xi,Yi)\ni=1 from a p+q dimensional multivariate distribution where Xi ∈ Rp and Yi ∈Rq, determine whether aT X and bT Y are uncorrelated for every a ∈ Rp, b∈ Rq or not. We give minimax lower bound for this problem (when p+q,n ∞, (p+q)/n ≤ < ∞, without sparsity assumptions). In summary, our results imply that n must be at least as large as pq/\|XY\|F2 for any procedure (test) to have non-trivial power, where XY is the cross-covariance matrix of X,Y. We also provide some evidence that the lower bound is tight, by connections to two-sample testing and regression in specific settings.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.