Deep Neural Networks with ReLU-Sine-Exponential Activations Break Curse of Dimensionality in Approximation on H\"older Class
Abstract
In this paper, we construct neural networks with ReLU, sine and 2x as activation functions. For general continuous f defined on [0,1]d with continuity modulus ωf(·), we construct ReLU-sine-2x networks that enjoy an approximation rate O(ωf(d)·2-M+ωf(dN)), where M,N∈ N+ denote the hyperparameters related to widths of the networks. As a consequence, we can construct ReLU-sine-2x network with the depth 5 and width \2d3/2(3με)1/α,223μ dα/22ε+2\ that approximates f∈ Hμα([0,1]d) within a given tolerance ε >0 measured in Lp norm p∈[1,∞), where Hμα([0,1]d) denotes the H\"older continuous function class defined on [0,1]d with order α ∈ (0,1] and constant μ > 0. Therefore, the ReLU-sine-2x networks overcome the curse of dimensionality on Hμα([0,1]d). In addition to its supper expressive power, functions implemented by ReLU-sine-2x networks are (generalized) differentiable, enabling us to apply SGD to train.