Improved Space Bounds for Learning with Experts
Abstract
We give improved tradeoffs between space and regret for the online learning with expert advice problem over T days with n experts. Given a space budget of nδ for δ ∈ (0,1), we provide an algorithm achieving regret O(n2 T1/(1+δ)), improving upon the regret bound O(n2 T2/(2+δ)) in the recent work of [PZ23]. The improvement is particularly salient in the regime δ → 1 where the regret of our algorithm approaches On(T), matching the T dependence in the standard online setting without space restrictions.
0
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.