Optimal prediction in the linearly transformed spiked model

Abstract

We consider the linearly transformed spiked model, where observations Yi are noisy linear transforms of unobserved signals of interest Xi: align* Yi = Ai Xi + i, align* for i=1,…,n. The transform matrices Ai are also observed. We model Xi as random vectors lying on an unknown low-dimensional space. How should we predict the unobserved signals (regression coefficients) Xi? The naive approach of performing regression for each observation separately is inaccurate due to the large noise. Instead, we develop optimal linear empirical Bayes methods for predicting Xi by "borrowing strength" across the different samples. Our methods are applicable to large datasets and rely on weak moment assumptions. The analysis is based on random matrix theory. We discuss applications to signal processing, deconvolution, cryo-electron microscopy, and missing data in the high-noise regime. For missing data, we show in simulations that our methods are faster, more robust to noise and to unequal sampling than well-known matrix completion methods.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…