Structural Risk Minimization for C1,1(Rd) Regression

Abstract

One means of fitting functions to high-dimensional data is by providing smoothness constraints. Recently, the following smooth function approximation problem was proposed: given a finite set E ⊂ Rd and a function f: E → R, interpolate the given information with a function f ∈ C1, 1(Rd) (the class of first-order differentiable functions with Lipschitz gradients) such that f(a) = f(a) for all a ∈ E, and the value of Lip(∇ f) is minimal. An algorithm is provided that constructs such an approximating function f and estimates the optimal Lipschitz constant Lip(∇ f) in the noiseless setting. We address statistical aspects of reconstructing the approximating function f from a closely-related class C1, 1(Rd) given samples from noisy data. We observe independent and identically distributed samples y(a) = f(a) + (a) for a ∈ E, where (a) is a noise term and the set E ⊂ Rd is fixed and known. We obtain uniform bounds relating the empirical risk and true risk over the class FM = \f ∈ C1, 1(Rd) Lip(∇ f) ≤ M\, where the quantity M grows with the number of samples at a rate governed by the metric entropy of the class C1, 1(Rd). Finally, we provide an implementation using Vaidya's algorithm, supporting our results via numerical experiments on simulated data.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…