Regularization and the small-ball method II: complexity dependent error rates

Abstract

For a convex class of functions F, a regularization functions (·) and given the random data (Xi, Yi)i=1N, we study estimation properties of regularization procedures of the form equation* f ∈ argminf∈ F(1NΣi=1N(Yi-f(Xi))2+λ (f)) equation* for some well chosen regularization parameter λ. We obtain bounds on the L2 estimation error rate that depend on the complexity of the "true model" F*:=\f∈ F: (f)≤(f*)\, where f*∈ argminf∈ FE(Y-f(X))2 and the (Xi,Yi)'s are independent and distributed as (X,Y). Our estimate holds under weak stochastic assumptions -- one of which being a small-ball condition satisfied by F -- and for rather flexible choices of regularization functions (·). Moreover, the result holds in the learning theory framework: we do not assume any a-priori connection between the output Y and the input X. As a proof of concept, we apply our general estimation bound to various choices of , for example, the p and Sp-norms (for p≥1), weak-p, atomic norms, max-norm and SLOPE. In many cases, the estimation rate almost coincides with the minimax rate in the class F*.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…