On the Optimal Amount of Experimentation in Sequential Decision Problems

Abstract

We provide a tight bound on the amount of experimentation under the optimal strategy in sequential decision problems. We show the applicability of the result by providing a bound on the cut-off in a one-arm bandit problem.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…