Equispaced Fourier representations for efficient Gaussian process regression from a billion data points
Abstract
We introduce a Fourier-based fast algorithm for Gaussian process regression in low dimensions. It approximates a translationally-invariant covariance kernel by complex exponentials on an equispaced Cartesian frequency grid of M nodes. This results in a weight-space M× M system matrix with Toeplitz structure, which can thus be applied to a vector in O(M M) operations via the fast Fourier transform (FFT), independent of the number of data points N. The linear system can be set up in O(N + M M) operations using nonuniform FFTs. This enables efficient massive-scale regression via an iterative solver, even for kernels with fat-tailed spectral densities (large M). We provide bounds on both kernel approximation and posterior mean errors. Numerical experiments for squared-exponential and Mat\'ern kernels in one, two and three dimensions often show 1-2 orders of magnitude acceleration over state-of-the-art rank-structured solvers at comparable accuracy. Our method allows 2D Mat\'ern-32 regression from N=109 data points to be performed in 2 minutes on a standard desktop, with posterior mean accuracy 10-3. This opens up spatial statistics applications 100 times larger than previously possible.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.