On the Minimax Regret for Linear Bandits in a wide variety of Action Spaces

Aditya Gopalan

On the Minimax Regret for Linear Bandits in a wide variety of Action Spaces

Abstract

As noted in the works of lattimore2020bandit, it has been mentioned that it is an open problem to characterize the minimax regret of linear bandits in a wide variety of action spaces. In this article we present an optimal regret lower bound for a wide class of convex action spaces.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Or open the topic learn hub

Discussion (0)

Sign in to join the discussion.

Loading comments…