Deep Network Approximation Characterized by Number of Neurons
Abstract
This paper quantitatively characterizes the approximation power of deep feed-forward neural networks (FNNs) in terms of the number of neurons. It is shown by construction that ReLU FNNs with width O(\d N1/d,\, N+1\) and depth O(L) can approximate an arbitrary H\"older continuous function of order α∈ (0,1] on [0,1]d with a nearly tight approximation rate O(d N-2α/dL-2α/d) measured in Lp-norm for any N,L∈ N+ and p∈[1,∞]. More generally for an arbitrary continuous function f on [0,1]d with a modulus of continuity ωf(·), the constructive approximation rate is O(d\,ωf( N-2/dL-2/d)). We also extend our analysis to f on irregular domains or those localized in an -neighborhood of a dM-dimensional smooth manifold M⊂eq [0,1]d with dM d. Especially, in the case of an essentially low-dimensional domain, we show an approximation rate O(ωf(1-δddδ+)+d\,ωf(d(1-δ)dδN-2/dδL-2/dδ)) for ReLU FNNs to approximate f in the -neighborhood, where dδ=O(dM (d/δ)δ2) for any δ∈(0,1) as a relative error for a projection to approximate an isometry when projecting M to a dδ-dimensional domain.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.