LASSO risk and phase transition under dependence
Abstract
We consider the problem of recovering a k-sparse signal β0∈Rp from noisy observations y= Xβ0+ w∈Rn. One of the most popular approaches is the l1-regularized least squares, also known as LASSO. We analyze the mean square error of LASSO in the case of random designs in which each row of X is drawn from distribution N(0,) with general . We first derive the asymptotic risk of LASSO in the limit of n,p→∞ with n/p→δ. We then examine conditions on n, p, and k for LASSO to exactly reconstruct β0 in the noiseless case w=0. A phase boundary δc=δ(ε) is precisely established in the phase space defined by 0δ,ε 1, where ε=k/p. Above this boundary, LASSO perfectly recovers β0 with high probability. Below this boundary, LASSO fails to recover β0 with high probability. While the values of the non-zero elements of β0 do not have any effect on the phase transition curve, our analysis shows that δc does depend on the signed pattern of the nonzero values of β0 for general Ip. This is in sharp contrast to the previous phase transition results derived in i.i.d. case with = Ip where δc is completely determined by ε regardless of the distribution of β0. Underlying our formalism is a recently developed efficient algorithm called approximate message passing (AMP) algorithm. We generalize the state evolution of AMP from i.i.d. case to general case with Ip. Extensive computational experiments confirm that our theoretical predictions are consistent with simulation results on moderate size system.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.