Approximation and learning of anisotropic and mixed smooth functions by deep ReLU neural networks
Abstract
This paper studies how efficiently deep ReLU neural networks can approximate and learn smooth functions. When the error is measured in Lp([0,1]d) norm and the approximator is a network with width W and depth L, recent works have proven the supper approximation rate O((WL)-2s/d) for Besov space Bsq,r([0,1]d) under the Sobolev embedding condition s/d>1/q-1/p. In order to overcome the curse of dimensionality in this rate, we extent this result to anisotropic and mixed smooth function classes. We establish the approximation rate O((WL)-2s) for anisotropic Besov space Bsq,r([0,1]d) with anisotropic smoothness s=(s1,…,sd) under the embedding condition s > 1/q-1/p, where the mean smoothness s = (Σi=1d si-1)-1. For mixed smooth Besov space MBsq,r([0,1]d) with mixed smoothness s>1/q-1/p, we show that the approximation rate O((WL)-2s) holds up to logarithmic factors. Using these results, we also derive approximation bounds for the composition of anisotropic Besov functions. As an application, it is shown that deep ReLU neural networks can achieve minimax optimal rates up to logarithmic factors for a wide range of smooth function classes.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.