When Does 2-Boosting Overfit Benignly? High-Dimensional Risk Asymptotics and the 1 Implicit Bias

Abstract

Benign overfitting is well-characterized in 2 geometries, but its behavior under the 1 implicit bias of greedy ensembles remains challenging. The analytical barrier stems from the non-linear coupling of coordinate selection thresholds, which invalidates standard spectral resolvent tools. To isolate this algorithmic bias, we characterize the high-dimensional risk of continuous-time 2-Boosting over p features and n samples. By coupling the Convex Gaussian Minimax Theorem with delicate asymptotic expansions of double-sided truncated Gaussian moments, we analytically resolve the non-smooth 1 interpolant. Under an isotropic pure-noise model, we prove that benign overfitting fails at the linear rate: greedy selection localizes noise into sparse active sets, and the excess variance decays at a logarithmic rate (σ2/(p/n)) for noise variance σ2. We remark that while this localization mechanism should persist in the presence of signals, the exact signal-noise decomposition remains an open problem. For spiked-isotropic designs with k* head eigenvalues and r2 = p - k* tail dimensions, the risk converges to zero when r2 n, but only at a logarithmic rate (σ2/(r2/n)), which is slower than the linear decay observed in 2 geometries. To avoid this slow convergence, we analyze the non-smooth subdifferential dynamics of the boosting flow. This yields a tuning-free early stopping rule that, under a bounded 1-path condition, recovers the Lasso basic inequality and attains the minimax-optimal empirical prediction rate for 1-bounded signals.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…