Global testing under the sparse alternatives for single index models
Abstract
For the single index model y=f(βτx,ε) with Gaussian design, %satisfying that rank var(E[x y])=1 where f is unknown and β is a sparse p-dimensional unit vector with at most s nonzero entries, we are interested in testing the null hypothesis that β, when viewed as a whole vector, is zero against the alternative that some entries of β is nonzero. Assuming that var(E[x y]) is non-vanishing, we define the generalized signal-to-noise ratio (gSNR) λ of the model as the unique non-zero eigenvalue of var(E[x y]). We show that if s22(p) p is of a smaller order of n, denoted as s22(p) p n, where n is the sample size, one can detect the existence of signals if and only if gSNRp1/2n s(p)n. Furthermore, if the noise is additive (i.e., y=f(βτx)+ε), one can detect the existence of the signal if and only if gSNRp1/2n s(p)n 1n. It is rather surprising that the detection boundary for the single index model with additive noise matches that for linear regression models. These results pave the road for thorough theoretical analysis of single/multiple index models in high dimensions.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.