Adaptive Regularized Submodular Maximization

Abstract

In this paper, we study the problem of maximizing the difference between an adaptive submodular (revenue) function and an non-negative modular (cost) function under the adaptive setting. The input of our problem is a set of n items, where each item has a particular state drawn from some known prior distribution p. The revenue function g is defined over items and states, and the cost function c is defined over items, i.e., each item has a fixed cost. The state of each item is unknown initially, one must select an item in order to observe its realized state. A policy π specifies which item to pick next based on the observations made so far. Denote by gavg(π) the expected revenue of π and let cavg(π) denote the expected cost of π. Our objective is to identify the best policy πo∈ πgavg(π)-cavg(π) under a k-cardinality constraint. Since our objective function can take on both negative and positive values, the existing results of submodular maximization may not be applicable. To overcome this challenge, we develop a series of effective solutions with performance grantees. Let πo denote the optimal policy. For the case when g is adaptive monotone and adaptive submodular, we develop an effective policy πl such that gavg(πl) - cavg(πl) ≥ (1-1e-ε)gavg(πo) - cavg(πo), using only O(nε-2 ε-1) value oracle queries. For the case when g is adaptive submodular, we present a randomized policy πr such that gavg(πr) - cavg(πr) ≥ 1egavg(πo) - cavg(πo).

0

Discussion (0)

Sign in to join the discussion.

Loading comments…