Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization
Abstract
We study high-dimensional convex empirical risk minimization (ERM) under general non-Gaussian data designs. By heuristically extending the Convex Gaussian Min-Max Theorem (CGMT) to non-Gaussian settings, we derive an asymptotic min-max characterization of key statistics, enabling approximation of the mean μθ and covariance Cθ of the ERM estimator θ. Specifically, under a concentration assumption on the data matrix and standard regularity conditions on the loss and regularizer, we show that for a test covariate x independent of the training data, the projection θ x approximately follows the convolution of the generally non-Gaussian distribution of μθ x with an independent centered Gaussian variable of variance tr(Cθ E[xx]). This result clarifies the scope and limits of Gaussian universality for ERMs. Additionally, we prove that any C2 regularizer is asymptotically equivalent to a quadratic form determined solely by its Hessian at zero and gradient at μθ. Numerical simulations across diverse losses and models are provided to validate our theoretical predictions and qualitative insights.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.