Online Realizable Regression and Applications for ReLU Networks
Abstract
Realizable online regression can behave very differently from online classification. Even without any margin or stochastic assumptions, realizability may enforce horizon-free (finite) cumulative loss under metric-like losses, even when the analogous classification problem has an infinite mistake bound. We study realizable online regression in the adversarial model under losses that satisfy an approximate triangle inequality (approximate pseudo-metrics). Recent work of Attias et al. shows that the minimax realizable cumulative loss is characterized by the scaled Littlestone/online dimension Donl, but this quantity can be difficult to analyze. Our main technical contribution is a generic potential method that upper bounds Donl by a concrete Dudley-type entropy integral that depends only on covering numbers of the hypothesis class under the induced sup pseudo-metric. We define an entropy potential Φ(H)=∫0diam(H) N(H,)\,d, where N(H,) is the -covering number of H, and show that for every c-approximate pseudo-metric loss, Donl(H) O(c)\,Φ(H). In particular, polynomial metric entropy implies Φ(H)<∞ and hence a horizon-free realizable cumulative-loss bound with transparent dependence on effective dimension. We illustrate the method on two families. We prove a sharp q-vs.-d dichotomy for realizable online learning (finite and efficiently achievable Θd,q(Ld) total loss for L-Lipschitz regression iff q>d, otherwise infinite), and for bounded-norm k-ReLU networks separate regression (finite loss, even O(k2), and O(1) for one ReLU) from classification (impossible already for k=2,d=1).
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.