The Dantzig selector and sparsity oracle inequalities
Abstract
Let \[Yj=f*(Xj)+j, j=1,...,n,\] where X,X1,...,Xn are i.i.d. random variables in a measurable space (S,A) with distribution and ,1,... ,n are i.i.d. random variables with E=0 independent of (X1,...,Xn). Given a dictionary h1,...,hN:SR, let fλ:=Σj=1Nλjhj, λ=(λ1,...,λN)∈RN. Given >0, define \[:=\ bda∈RN:1≤ k≤ N|n-1Σj=1n l(fλ(Xj)-Yj)hk(Xj)|≤ \\] and \[λ:=λ∈ Arg minλ∈\|λ\|_1.\] In the case where f*:=fλ*,λ*∈ RN, Candes and Tao [Ann. Statist. 35 (2007) 2313--2351] suggested using λ as an estimator of λ*. They called this estimator ``the Dantzig selector''. We study the properties of fλ as an estimator of f* for regression models with random design, extending some of the results of Candes and Tao (and providing alternative proofs of these results).