The Price of Differential Privacy For Online Learning

Karan Singh

The Price of Differential Privacy For Online Learning

Abstract

We design differentially private algorithms for the problem of online linear optimization in the full information and bandit settings with optimal O(T) regret bounds. In the full-information setting, our results demonstrate that ε-differential privacy may be ensured for free -- in particular, the regret bounds scale as O(T)+O(1ε). For bandit linear optimization, and as a special case, for non-stochastic multi-armed bandits, the proposed algorithm achieves a regret of O(1εT), while the previously known best regret bound was O(1εT23).

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or compile a full topic from this idea

Discussion (0)

Sign in to join the discussion.

Loading comments…