Most Neural Networks Are Almost Learnable

Abstract

We present a PTAS for learning random constant-depth networks. We show that for any fixed ε>0 and depth i, there is a poly-time algorithm that for any distribution on d · Sd-1 learns random Xavier networks of depth i, up to an additive error of ε. The algorithm runs in time and sample complexity of (d)poly(ε-1), where d is the size of the network. For some cases of sigmoid and ReLU-like activations the bound can be improved to (d)polylog(ε-1), resulting in a quasi-poly-time algorithm for learning constant depth random networks.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…