On a Central Limit Theorem and Sanov's principle for quantum neural networks
Abstract
In this work, we study the fluctuations of a Mixture of Experts (MoE) generated by a quantum neural network trained via gradient flow on supervised learning problems. Our main results establish the Central Limit Theorem (CLT), and Sanov's principle for an MoE as the number of experts diverges. We demonstrate that the fluctuations of the empirical measure of its parameters close to its corresponding limit probability measure solve a linear transport equation. As a byproduct, we show that the MoE converges to a limit function which solves an evolution equation governed by the neural tangent kernel associated with the quantum neural network.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.