Two statistical problems for multivariate mixture distributions
Abstract
We address two important statistical problems: that of estimating mixtures of multivariate normal distributions and mixtures of t-distributions based on univariate projections, and that of quantifying a discrepancy between mixture distributions induced by two model-based clusterings. In the second problem, rather than introducing a direct metric on partitions, we propose a model-based distributional discrepancy between the fitted mixture distributions associated with two clusterings. The results are based on an earlier work of the authors, where it was shown that mixtures of multivariate Gaussian or t-distributions can be distinguished by projecting them onto a certain predetermined finite set of lines, the number of lines depending only on the total number of distributions involved and on the ambient dimension. We also compare our proposal with robust versions of the expectation-maximization method EM. In each case, we present algorithms for effecting the task, and compare them with existing methods by carrying out some simulations.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.