Mean and Variance Estimation Complexity in Arbitrary Distributions via Wasserstein Minimization
Abstract
Parameter estimation is a fundamental challenge in machine learning, crucial for tasks such as neural network weight fitting and Bayesian inference. This paper focuses on the complexity of estimating translation μ ∈ Rl and shrinkage σ ∈ R++ parameters for a distribution of the form 1σl f0 ( x - μσ ), where f0 is a known density in Rl given n samples. We highlight that while the problem is NP-hard for Maximum Likelihood Estimation (MLE), it is possible to obtain -approximations for arbitrary > 0 within poly ( 1 ) time using the Wasserstein distance.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.