Stopping on the last success with unknown odds: Impossibility barriers and quantitative oracle bounds
Abstract
We consider the classical last-success problem for sequential Bernoulli trials in the homogeneous setting where X1,…,Xn are i.i.d. Bernoulli(p) but the success probability p∈(0,1) is unknown to the decision maker. When p is known, Bruss' sum-the-odds theorem yields an optimal threshold rule with value Vn(p). We study a natural oracle-free plug-in rule that replaces p by the online empirical estimate pt and we denote its win probability by Wn(p). First, we derive an exact expression for Wn(p) via a recursion for the state probabilities, enabling explicit comparisons with Vn(p) and revealing a finite-horizon separation between plug-in and oracle performance. Next, we formalize a first decision-theoretic obstruction inherent to the unknown-p formulation: for every fixed n2, the dominance partial order on p-blind (possibly randomized) rules has no greatest element. We then identify regimes where oracle-freeness is achievable with sharp bounds. For any p0∈(0,1), we establish finite-horizon oracle bounds on [p0,1) and we prove a matching minimax lower bound of order 1/n for p0∈(0, 12) (larger values of p0 do not allow for non-trivial lower bounds). We also show that the rate is exponential for any fixed p. In sparse regimes where p=pn0 with npn∞, we prove asymptotic oracle-optimality of the plug-in rule, in the sense that Vn(pn)-Wn(pn)0. Together with our non-sparse bounds, this yields a broad uniform convergence guarantee, which we show cannot be extended to the critical regime p 1/n. Finally, we establish another impossibility barrier: even allowing randomization, no oracle-free sequence of rules can converge uniformly to the oracle value over p∈(0,1).
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.