Optimal Bias-variance Tradeoff in Matrix and Tensor Estimation
Abstract
We study matrix and tensor denoising when the underlying signal is not necessarily low-rank. In the tensor setting, we observe \[ Y = X + Z ∈ Rp1 × p2 × p3, \] where X is an unknown signal tensor and Z is a noise tensor. We propose a one-step variant of the higher-order SVD (HOSVD) estimator, denoted X, and show that, uniformly over any user-specified Tucker ranks (r1,r2,r3), with high probability, \[ \| X - X\| F2 = O( 2\r1r2r3 + Σk=13 pk rk\ + (r1,r2,r3)2 ). \] Here, (r1,r2,r3) is the best achievable Tucker rank-(r1,r2,r3) approximation error of X (bias), 2 quantifies the noise level, and 2\r1r2r3+Σk=13 pk rk\ is the variance term scaling with the effective degrees of freedom of X. This yields a rank-adaptive bias-variance tradeoff: increasing (r1,r2,r3) decreases the bias (r1,r2,r3) while increasing variance. In the matrix setting, we show that truncated SVD achieves an analogous bias-variance tradeoff for arbitrary signal matrices. Notably, our matrix result requires no assumptions on the signal matrix, such as finite rank or spectral gaps. Finally, we complement our upper bounds with matching information-theoretic lower bounds, showing that the resulting bias-variance tradeoff is minimax optimal up to universal constants in both the matrix and tensor settings.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.