A Near-optimal Algorithm for Learning Margin Halfspaces with Massart Noise
Abstract
We study the problem of PAC learning γ-margin halfspaces in the presence of Massart noise. Without computational considerations, the sample complexity of this learning problem is known to be (1/(γ2 ε)). Prior computationally efficient algorithms for the problem incur sample complexity O(1/(γ4 ε3)) and achieve 0-1 error of η+ε, where η<1/2 is the upper bound on the noise rate. Recent work gave evidence of an information-computation tradeoff, suggesting that a quadratic dependence on 1/ε is required for computationally efficient algorithms. Our main result is a computationally efficient learner with sample complexity (1/(γ2 ε2)), nearly matching this lower bound. In addition, our algorithm is simple and practical, relying on online SGD on a carefully selected sequence of convex losses.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.