The EM Algorithm is Adaptively-Optimal for Unbalanced Symmetric Gaussian Mixtures

Abstract

This paper studies the problem of estimating the means θ*∈Rd of a symmetric two-component Gaussian mixture δ*· N(θ*,I)+(1-δ*)· N(-θ*,I) where the weights δ* and 1-δ* are unequal. Assuming that δ* is known, we show that the population version of the EM algorithm globally converges if the initial estimate has non-negative inner product with the mean of the larger weight component. This can be achieved by the trivial initialization θ0=0. For the empirical iteration based on n samples, we show that when initialized at θ0=0, the EM algorithm adaptively achieves the minimax error rate O(\1(1-2δ*)dn,1\|θ*\|dn,(dn)1/4\) in no more than O(1\|θ*\|(1-2δ*)) iterations (with high probability). We also consider the EM iteration for estimating the weight δ*, assuming a fixed mean θ (which is possibly mismatched to θ*). For the empirical iteration of n samples, we show that the minimax error rate O(1\|θ*\|dn) is achieved in no more than O(1\|θ*\|2) iterations. These results robustify and complement recent results of Wu and Zhou obtained for the equal weights case δ*=1/2.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…