Online Sampling and Decision Making with Low Entropy

Abstract

Consider the problem: we are given n boxes, labeled \1,2,…, n\ by an adversary, each containing a single number chosen from an unknown distribution; these n distributions are not necessarily identical. We are also given an integer k ≤ n. We have to choose an order in which we will sequentially open these boxes, and each time we open the next box in this order, we learn the number in the box. Once we reject a number in a box, the box cannot be recalled. Our goal is to accept k of these numbers, without necessarily opening all boxes, such that the accepted numbers are the k largest numbers in the boxes (thus their sum is maximized). A natural approach to solve such problems is to use randomness to sample randomly ordered elements, however, as indicated in several sources, e.g., Turan et al. NIST'15, Bierhorst et al. Nature'18, pure randomness is hard to get in reality. We present an algorithm for this problem, which is provably and simultaneously near-optimal with respect to the achieved competitive ratio and the used amount of randomness. In particular, we construct a distribution on the orders with entropy ( n) such that a deterministic multiple-threshold algorithm gives a competitive ratio 1-O( k/k), for k < n/ n. Our competitive ratio is simultaneously optimal and uses optimal entropy ( n), improving in three ways the previous best known algorithm, whose competitive ratio is 1 - O(1/k1/3) - o(1).

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…