L1 Regression with Lewis Weights Subsampling
Abstract
We consider the problem of finding an approximate solution to 1 regression while only observing a small number of labels. Given an n × d unlabeled data matrix X, we must choose a small set of m n rows to observe the labels of, then output an estimate β whose error on the original problem is within a 1 + factor of optimal. We show that sampling from X according to its Lewis weights and outputting the empirical minimizer succeeds with probability 1-δ for m > O(12 d d δ). This is analogous to the performance of sampling according to leverage scores for 2 regression, but with exponentially better dependence on δ. We also give a corresponding lower bound of (d2 + (d + 12) 1δ).
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.