Online learning of smooth functions on R
Abstract
We study adversarial online learning of real-valued functions on R. In each round the learner is queried at xt∈R, predicts yt, and then observes the true value f(xt); performance is measured by cumulative p-loss Σt 1| yt-f(xt)|p. For the class \[ Gq=\f:R\ absolutely continuous:\ ∫R|f'(x)|q\,dx 1\, \] we show that the standard model becomes ill-posed on R: for every p 1 and q>1, an adversary can force infinite loss. Motivated by this obstruction, we analyze three modified learning scenarios that limit the influence of queries that are far from previously observed inputs. In Scenario 1 the adversary must choose each new query within distance 1 of some past query. In Scenario 2 the adversary may query anywhere, but the learner is penalized only on rounds whose query lies within distance 1 of a past query. In Scenario 3 the loss in round t is multiplied by a weight g(j<t|xt-xj|). We obtain sharp characterizations for Scenarios 1-2 in several regimes. For Scenario 3 we identify a clean threshold phenomenon: if g decays too slowly, then the adversary can force infinite weighted loss. In contrast, for rapidly decaying weights such as g(z)=e-cz we obtain finite and sharp guarantees in the quadratic case p=q=2. Finally, we study a natural multivariable slice generalization Gq,d of Gq on Rd and show a sharp dichotomy: while the one-dimensional case admits finite opt-values in certain regimes, for every d 2 the slice class Gq,d is too permissive, and even under Scenarios 1-3 an adversary can force infinite loss.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.