Bootstrap of residual processes in regression: to smooth or not to smooth ?
Abstract
In this paper we consider a location model of the form Y = m(X) + , where m(·) is the unknown regression function, the error is independent of the p-dimensional covariate X and E()=0. Given i.i.d. data (X1,Y1),…,(Xn,Yn) and given an estimator m(·) of the function m(·) (which can be parametric or nonparametric of nature), we estimate the distribution of the error term by the empirical distribution of the residuals Yi- m(Xi), i=1,…,n. To approximate the distribution of this estimator, Koul and Lahiri (1994) and Neumeyer (2008, 2009) proposed bootstrap procedures, based on smoothing the residuals either before or after drawing bootstrap samples. So far it has been an open question whether a classical non-smooth residual bootstrap is asymptotically valid in this context. In this paper we solve this open problem, and show that the non-smooth residual bootstrap is consistent. We illustrate this theoretical result by means of simulations, that show the accuracy of this bootstrap procedure for various models, testing procedures and sample sizes.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.