De-Biasing The Lasso With Degrees-of-Freedom Adjustment
Abstract
This paper studies schemes to de-bias the Lasso in a linear model y=Xβ+ε where the goal is to construct confidence intervals for a0Tβ in a direction a0, where X has iid N(0,) rows. We show that previously analyzed propositions to de-bias the Lasso require a modification in order to enjoy efficiency in a full range of sparsity. This modification takes the form of a degrees-of-freedom adjustment that accounts for the dimension of the model selected by Lasso. Let s0 be the true sparsity. If is known and the ideal score vector proportional to X-1a0 is used, the unadjusted de-biasing schemes proposed previously enjoy efficiency if s0 n2/3. However, if s0 n2/3, the unadjusted schemes cannot be efficient in certain a0: then it is necessary to modify existing procedures by a degrees-of-freedom adjustment. This modification grants asymptotic efficiency for any a0 when s0/p 0 and s0(p/s0)/n 0. If is unknown, efficiency is granted for general a0 when s0 pn+\s pn,\|-1a0\|1 p\|-1/2a0\|2 n\+(s,s0) p n0 where s=\|-1a0\|0, provided that the de-biased estimate is modified with the degrees-of-freedom adjustment. The dependence in s0,s and \|-1a0\|1 is optimal. Our estimated score vector provides a novel methodology to handle dense a0. Our analysis shows that the degrees-of-freedom adjustment is not needed when the initial bias in direction a0 is small, which is granted under stringent conditions on -1. The main proof argument is an interpolation path similar to that typically used to derive Slepian's lemma. It yields a new ∞ error bound for the Lasso which is of independent interest.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.