Arithmetic functions and learning theory
Abstract
We establish a connection between analytic number theory and computational learning theory by showing that the M\"obius function belongs to a class of functions that is statistically hard to learn from random samples. Let μR denote the restriction of the M\"obius function to the squarefree integers in \1,…,R\. Using a recent lower bound of Pandey and Radziwi for the L1 norm of exponential sums with M\"obius coefficients, we prove that \[ (μR) R-1/4-ε \] for every ε>0. We then show that, for a suitable absolute constant c0>0, the class of \-1,1\-valued functions on the squarefree integers with Fourier Ratio at least c0 has Vapnik--Chervonenkis dimension at least cR. It follows that any distribution-independent learning algorithm that succeeds uniformly on the class HR(ηR) containing μR, where ηR 0, requires at least (R) samples. We also discuss a conditional improvement under a strong uniform bound for additive twists of the M\"obius function, and we note that the same method applies to the Liouville function.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.