Function approximation by deep neural networks with parameters \0, 12, 1, 2\
Abstract
In this paper it is shown that Cβ-smooth functions can be approximated by deep neural networks with ReLU activation function and with parameters \0, 12, 1, 2\. The l0 and l1 parameter norms of considered networks are thus equivalent. The depth, width and the number of active parameters of the constructed networks have, up to a logarithmic factor, the same dependence on the approximation error as the networks with parameters in [-1,1]. In particular, this means that the nonparametric regression estimation with the constructed networks attains the same convergence rate as with sparse networks with parameters in [-1,1].
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.