SLOPE is Adaptive to Unknown Sparsity and Asymptotically Minimax

Abstract

We consider high-dimensional sparse regression problems in which we observe y = X β + z, where X is an n × p design matrix and z is an n-dimensional vector of independent Gaussian errors, each with variance σ2. Our focus is on the recently introduced SLOPE estimator ((Bogdan et al., 2014)), which regularizes the least-squares estimates with the rank-dependent penalty Σ1 i p λi | β|(i), where | β|(i) is the ith largest magnitude of the fitted coefficients. Under Gaussian designs, where the entries of X are i.i.d.~N(0, 1/n), we show that SLOPE, with weights λi just about equal to σ · -1(1-iq/(2p)) (-1(α) is the αth quantile of a standard normal and q is a fixed number in (0,1)) achieves a squared error of estimation obeying \[ \| β\|0 k \,\, P (\| βSLOPE - β \|2 > (1+ε) \, 2σ2 k (p/k) ) 0 \] as the dimension p increases to ∞, and where ε > 0 is an arbitrary small constant. This holds under a weak assumption on the 0-sparsity level, namely, k/p → 0 and (k p)/n → 0, and is sharp in the sense that this is the best possible error any estimator can achieve. A remarkable feature is that SLOPE does not require any knowledge of the degree of sparsity, and yet automatically adapts to yield optimal total squared errors over a wide range of 0-sparsity classes. We are not aware of any other estimator with this property.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…