A SMART Stochastic Algorithm for Nonconvex Optimization with Applications to Robust Machine Learning
Abstract
In this paper, we show how to transform any optimization problem that arises from fitting a machine learning model into one that (1) detects and removes contaminated data from the training set while (2) simultaneously fitting the trimmed model on the uncontaminated data that remains. To solve the resulting nonconvex optimization problem, we introduce a fast stochastic proximal-gradient algorithm that incorporates prior knowledge through nonsmooth regularization. For datasets of size n, our approach requires O(n2/3/) gradient evaluations to reach -accuracy and, when a certain error bound holds, the complexity improves to O( n2/3(1/)). These rates are n1/3 times better than those achieved by typical, full gradient methods.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.