Approximation Rates for Shallow ReLUk Neural Networks on Sobolev Spaces via the Radon Transform
Abstract
Let ⊂ Rd be a bounded domain. We consider the problem of how efficiently shallow neural networks with the ReLUk activation function can approximate functions from Sobolev spaces Ws(Lp()) with error measured in the Lq()-norm. Utilizing the Radon transform and recent results from discrepancy theory, we provide a simple proof of nearly optimal approximation rates in a variety of cases, including when q≤ p, p≥ 2, and s ≤ k + (d+1)/2. The rates we derive are optimal up to logarithmic factors, and significantly generalize existing results. An interesting consequence is that the adaptivity of shallow ReLUk neural networks enables them to obtain optimal approximation rates for smoothness up to order s = k + (d+1)/2, even though they represent piecewise polynomials of fixed degree k.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.