Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization

Abstract

We study high-dimensional convex empirical risk minimization (ERM) under general non-Gaussian data designs. By heuristically extending the Convex Gaussian Min-Max Theorem (CGMT) to non-Gaussian settings, we derive an asymptotic min-max characterization of key statistics, enabling approximation of the mean μθ and covariance Cθ of the ERM estimator θ. Specifically, under a concentration assumption on the data matrix and standard regularity conditions on the loss and regularizer, we show that for a test covariate x independent of the training data, the projection θ x approximately follows the convolution of the generally non-Gaussian distribution of μθ x with an independent centered Gaussian variable of variance tr(Cθ E[xx]). This result clarifies the scope and limits of Gaussian universality for ERMs. Additionally, we prove that any C2 regularizer is asymptotically equivalent to a quadratic form determined solely by its Hessian at zero and gradient at μθ. Numerical simulations across diverse losses and models are provided to validate our theoretical predictions and qualitative insights.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…