Faster and Sample Near-Optimal Algorithms for Proper Learning Mixtures of Gaussians
Abstract
We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians without any separability assumptions. Given O(1/2) samples from an unknown mixture, our algorithm outputs a mixture that is -close in total variation distance, in time O(1/5). Our sample complexity is optimal up to logarithmic factors, and significantly improves upon both Kalai et al., whose algorithm has a prohibitive dependence on 1/, and Feldman et al., whose algorithm requires bounds on the mixture parameters and depends pseudo-polynomially in these parameters. One of our main contributions is an improved and generalized algorithm for selecting a good candidate distribution from among competing hypotheses. Namely, given a collection of N hypotheses containing at least one candidate that is -close to an unknown distribution, our algorithm outputs a candidate which is O()-close to the distribution. The algorithm requires O(N/2) samples from the unknown distribution and O(N N/2) time, which improves previous such results (such as the Scheff\'e estimator) from a quadratic dependence of the running time on N to quasilinear. Given the wide use of such results for the purpose of hypothesis selection, our improved algorithm implies immediate improvements to any such use.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.