A note on an Adaptive Goodness-of-Fit test with Finite Sample Validity for Random Design Regression Models
Abstract
Given an i.i.d. sample \(Xi,Yi)\i ∈ \1 … n\ from the random design regression model Y = f(X) + ε with (X,Y) ∈ [0,1] × [-M,M], in this paper we consider the problem of testing the (simple) null hypothesis f = f0, against the alternative f ≠ f0 for a fixed f0 ∈ L2([0,1],GX), where GX(·) denotes the marginal distribution of the design variable X. The procedure proposed is an adaptation to the regression setting of a multiple testing technique introduced by Fromont and Laurent (2005), and it amounts to consider a suitable collection of unbiased estimators of the L2--distance d2(f,f0) = ∫ [f(x) - f0 (x)]2 d\,GX (x), rejecting the null hypothesis when at least one of them is greater than its (1-uα) quantile, with uα calibrated to obtain a level--α test. To build these estimators, we will use the warped wavelet basis introduced by Picard and Kerkyacharian (2004). We do not assume that the errors are normally distributed, and we do not assume that X and ε are independent but, mainly for technical reasons, we will assume, as in most part of the current literature in learning theory, that |f(x) - y| is uniformly bounded (almost everywhere). We show that our test is adaptive over a particular collection of approximation spaces linked to the classical Besov spaces.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.