A PTAS for a Class of Stochastic Dynamic Programs

Abstract

We develop a framework for obtaining polynomial time approximation schemes (PTAS) for a class of stochastic dynamic programs. Using our framework, we obtain the first PTAS for the following stochastic combinatorial optimization problems: : We are given a set of n items, each item i∈ [n] has a value Xi which is an independent random variable with a known (discrete) distribution πi. We can probe a subset P⊂eq [n] of items sequentially. Each time after probing an item i, we observe its value realization, which follows the distribution πi. We can adaptively probe at most m items and each item can be probed at most once. The reward is the maximum among the m realized values. Our goal is to design an adaptive probing policy such that the expected value of the reward is maximized. To the best of our knowledge, the best known approximation ratio is 1-1/e, due to Asadpour ηl~asadpour2015maximizing. We also obtain PTAS for some generalizations and variants of the problem and some other problems.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…