Innovation Capacity of Dynamical Learning Systems
Abstract
In noisy physical reservoirs, the classical information-processing capacity Cip quantifies how well a linear readout can realize tasks measurable from the input history, yet Cip can be far smaller than the observed rank of the readout covariance. We explain this ``missing capacity'' by introducing the innovation capacity Ci, the total capacity allocated to readout components orthogonal to the input filtration (Doob innovations, including input-noise mixing). Using a basis-free Hilbert-space formulation of the predictable/innovation decomposition, we prove the conservation law Cip+Ci=rank(XX) d, so predictable and innovation capacities exactly partition the rank of the observable readout dimension covariance XX∈ R d× d. In linear-Gaussian Johnson-Nyquist regimes, XX(T)=S+T N0, the split becomes a generalized-eigenvalue shrinkage rule and gives an explicit monotone tradeoff between temperature and predictable capacity. Geometrically, in whitened coordinates the predictable and innovation components correspond to complementary covariance ellipsoids, making Ci a trace-controlled innovation budget. A large Ci forces a high-dimensional innovation subspace with a variance floor and under mild mixing and anti-concentration assumptions this yields extensive innovation-block differential entropy and exponentially many distinguishable histories. Finally, we give an information-theoretic lower bound showing that learning the induced innovation-block law in total variation requires a number of samples that scales with the effective innovation dimension, supporting the generative utility of noisy physical reservoirs.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.