Linear Functional Testing with General Loadings in Sparse Regression: Separation Rates and Computational Barriers
Abstract
We study the problem of testing H0: ξβ=t0 in high-dimensional sparse linear regression with Gaussian random design and unknown design covariance. The loading vector ξ is arbitrary, and the exact sparsity level k is unknown but bounded by a known value ku. Tests are required to control Type I error uniformly over the ku-sparse null, while power is evaluated against k-sparse alternatives. We construct a computationally efficient mixed test that gives an upper bound on the adaptive separation distance and establish an information-theoretic lower bound calibrated to the magnitude profile of ξ. In the ultra-sparse regime ku n/ p, these bounds characterize the adaptive separation rate up to logarithmic factors for arbitrary ξ. In the moderately sparse regime n/ p ku n/ p, these bounds match for several classes of loading vectors but may differ in general. In this regime, we further prove a low-degree lower bound that matches the upper bound up to logarithmic factors. This provides evidence that improving on the rate of the mixed test, if statistically possible, may be computationally hard. For flat sparse loadings, we complement this evidence with a polynomial-time reduction from sparse CCA. Finally, we examine how information about the design covariance affects the adaptive separation rate in two settings. Under a sparse signed-spiked covariance model, the information-theoretic lower bound is attainable up to logarithmic factors by a computationally inefficient procedure, while the low-degree lower bound and sparse-CCA reduction continue to apply, providing evidence for a statistical-computational gap. When the design covariance is known and diagonal, the adaptive separation rate takes the same form as in the ultra-sparse regime.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.