One step futher: an explicit solution to Robbins' problem when n=4
Abstract
Fix some n ∈ N and let X1, X2,…, Xn be independent random variables drawn from the uniform distribution on [0,1]. A decision maker is shown the variables sequentially and, after each observation, must decide whether or not to keep the current one, with payoff the overall rank of the selected observation. Decisions are final: no recall is allowed, no regret is tolerated. The objective is to act in such a way as to minimise the expected payoff. In this note we give the explicit solution to this problem, known as Robbins' problem of optimal stopping, when n=4.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.