On a phase transition in general order spline regression
Abstract
In the Gaussian sequence model Y= θ0 + in Rn, we study the fundamental limit of approximating the signal θ0 by a class (d,d0,k) of (generalized) splines with free knots. Here d is the degree of the spline, d0 is the order of differentiability at each inner knot, and k is the maximal number of pieces. We show that, given any integer d≥ 0 and d0∈\-1,0,…,d-1\, the minimax rate of estimation over (d,d0,k) exhibits the following phase transition: equation* aligned ∈fθθ∈(d,d0, k)Eθ\|θ - θ\|2 d cases k(16n/k), & 2≤ k≤ k0,\\ k(en/k), & k ≥ k0+1. cases aligned equation* The transition boundary k0, which takes the form (d+1)/(d-d0) + 1, demonstrates the critical role of the regularity parameter d0 in the separation between a faster (16n) and a slower (en) rate. We further show that, once encouraging an additional 'd-monotonicity' shape constraint (including monotonicity for d = 0 and convexity for d=1), the above phase transition is eliminated and the faster k(16n/k) rate can be achieved for all k. These results provide theoretical support for developing 0-penalized (shape-constrained) spline regression procedures as useful alternatives to 1- and 2-penalized ones.